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Recombination Site Nucleotide Sequences 
attBl : 5 ' - ACAAGTTTGTACAAAAAAGCAGGCT - 3 ' 



attB2 : 5 ' - ACCCAGCTTTCTTGTACAAAGTGGT - 3 ' 



attPl : 5 ' - TACAGGTCACTAATACCATCTAAGTAGTTGATTCATAGTGACTGGATATG 
TTGTGTTTTACAGTATTATGTAGTCTG I MM I ATGCAAAATCTAATTTA 
ATATATTGATATTTATATCATTTTACGTTTCTCGTTCAGC M M M GTAC 
AMGTTGGCATTATAAAAAAGCATTGCTCATCAATTTGTTGCAACGAACA 
GGTC ACTATC AGTC AAAATAAAATCATTATTTG - 3 * 



attP2 : 5 ' - CAMTMTGATTTTATTTTGACTGATAGTGACCTGTTCGTTGCAACAAAT - 
TGATAAGCAATGCTTTCTTATAATGCCAACTTTGTACAAGAAAGCTGAAC- 
GAGAMCGTAAMTGATATAMTATCMTATATTAAATTAGATTTTGCAT - 
AAAAAACAGACTACATAATACTGTAAAACACAACATATCCAGTCACTATGA- 
ATCAACTACTTAGATGGTATTAGTGACCTGTA - 3 ' 



attRl : 5 1 - AC AAGTTTGTACAAAAAAGCTGAACGAGAAACGTAAAATGATATAAA - 
TATCMTATATTAMTOGATTTTGCATAAAAAACAGACTACATAATAC 
TGTAAAACACAACATATCCAGTCACTATG - 3 ' 



attR2 : 5 ' - GCAGGTCGACCATAGTGACTGGATATGTTGTGTTTTACAGTA™T - 
GTAGTCTG I MM I ATGC AAAATCTAATTTAATATATTGATATTT - 
ATATCATTTTACGTTTCTCGTTCAGCTTTCTTGTACAAAGTGGT - 3 ' 



attLl : 5 ' - CAMTMTGATTTTATTTTGACTGATAGTGACCTGTTCGTTGCAAC - 

AAATTGATAAGCAATGC M M I I ATAATGCCAACTTTGTACAAAAAA - GCAGGCT - 3 ' 



attL2 : 5 ' - CAMTMTGATTTTATTTTGACTGATAGTGACCTGTTCGTTGCAACAA - 

ATTGATAAGCAATGCTTTCTTATAATGCCAACTTTGTACAAGAAAGCTGGGT- 3 ' 
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Cloning sites of the Entry Vector pENTR 1A 
(reoding frame A) 



Dra 1 Xmn I 



Sol I ftwH I Kpn 1 EcoR I 



. r I I I I 

AAA GGA ACC AAT TCA G jTC GAC TG |G ATC CGGTACj CGjAATT C 
TTT CCT TGG TTA AGT CAG~Clfe ACCTaG- GCf~ATG GCTlAAl G 
thr leu tyr lys lys olo gly phe lys gly thr asn ser vol asp trp ile arg tyr org ile 



ACT TTG TAC AAA AAA GCA GGC TTT 
TGA AAC ATG TTT TTT CGT CCG AAA 



EcoR I Not I 



Xho I EcoR V 



ccdB gene 



1 

-G IAAT TC G CCG CCG CAC ITCG AG A TIAT CTA GAC CCA GCT TTC TTG TAC AAA 
C TTOx GCC GGC GTG AGC TCT AMA GAT CTG GGT CGA AAG AAC ATG TTT 
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pENTRIA 2717 bp 



Base Nos. 

67 . . 166 
321. .626 
655 . . 754 
877 . . 1686 



Gene Encoded 

attLl 

ccdB 

attl_2 

KmR 



1791.. 2364 



ori 



1 CTGACGGATG GCC I I I 1 1 GC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATG ATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTTTAA AGGAACCAAT 
181 TCAGTCGACT G GATCCG GTA CCGAATTCGC TTACTAAAAG CCAGATAACA GTATGCGTAT 
241 TTGCGCGCTG ATTTTTGCGG TATAAGAATA TATACTGATA TGTATACCCG AAGTATGTCA 
301 AAAAGAGGTG TGCTTCTAGA ATGCAGTTTA AGGTTTACAC CTATAAAAGA GAGAGCCGTT 
361 ATCGTCTGTT TGTGGATGTA CAGAGTGATA TTATTGACAC GCCCGGGCGA CGGATAGTGA 
421 TCCCCCTGGC CAGTGCACGT CTGCTGTCAG ATAAAGTCTC CCGTGAACTT TACCCGGTGG 
481 TGCATATCGG GGATGAAAGC TGGCGCATGA TGACCACCGA TATGGCCAGT GTGCCGGTCT 
541 CCGTTATCGG GGAAGAAGTG GCTGATCTCA GCCACCGCGA AAATGACATC AAAAACGCCA 
601 TTAACCTGAT GTTCTGGGGA ATATAGAATT CGCGGCCGCA CTCGAGATAT CTAGACCCAG 
661 CTTTCTTGTA CAAAGTTGGC ATTATAAGAA AGCATTGCTT ATCAATTTGT TGCAACGAAC 
721 AGGTCACTAT CAGTCAAAAT AAAATCA™ TTTGCCATCC AGCTGCAGCT CTGGCCCGTG 
781 TCTCAAAATC TCTGATGTTA CATTGCACAA GATAAAAATA TATCATCATG AACAATAAAA 
841 CTGTCTGCTT ACATAAACAG TAATACAAGG GGTGTTATGA GCCATATTCA ACGGGAAACG 
901 TCGAGGCCGC GATTAAATTC CAACATGGAT GCTGATTTAT ATGGGTATAA ATGGGCTCGC 
961 GATAATGTCG GGCAATCAGG TGCGACAATC TATCGCTTGT ATGGGAAGCC CGATGCGCCA 
1021 GAGTTGTTTC TGAAACATGG CAAAGGTAGC GTTGCCAATG ATGTTACAGA TGAGATGGTC 
1081 AGACTAAACT GGCTGACGGA ATTTATGCCT CTTCCGACCA TCAAGCATTT TATCCGTACT 
1141 CCTGATGATG CATGGTTACT CACCACTGCG ATCCCCGGAA AAACAGCATT CCAGGTATTA 
1201 GAAGAATATC CTGATTCAGG TGAAAATATT GTTGATGCGC TGGCAGTGTC CCTGCGCCGG 
1261 TTGCATTCGA TTCCTGTTTG TAATTGTCCT TTTAACAGCG ATCG CGTAT T TCGTCTCGCT 
1321 CAGGCGCAAT CACGAATGAA TAACGGTTTG GTTGATGCGA GTGATTTTGA TGACGAGCGT 
1381 AATGGCTGGC CTGTTGAACA AGTCTGGAAA GAAATGCATA AAC TTTTGC C ATTCTCACCG 
1441 GATTCAGTCG TCACTCATGG TGATTTCTCA CTTGATAACC TTATTTTTGA CGAGGGGAAA 
1501 TTAATAGGTT GTATTGATGT TGGA CGAG TC GGAATCGCAG ACCGATACCA GGAT CTTGCC 
1561 ATCCTATGGA ACTGCCTCGG TGAGTTTTCT CCTTCATTAC AGAAACGGCT TTTTCAAAAA 
1621 TATGGTATTG ATAATCCTGA TATGAATAAA TTGCAGTTTC ATTTGATGCT CGATGAGTTT 
1681 TTCTAATCAG AATTGGTTAA TTGGTTGTAA CATTATTCAG ATTG GGCCCC GTTCCACTGA 
1741 GCGTCAGACC CCGTAGAAAA GATCAAAGGA TCTTCTTGAG ATCCI 1 1 1 1 1 TCTGCGCGTA 
1801 ATCTGCTGCT TGC AAACA AA AAAACCACCG CTACCAGCGG TGGTTTGTTT GCCGGATCAA 
1861 GAGCTACCAA CTCTTTTTCC GAAGGTAACT GGCTTCAGCA GAGCGCAGAT ACCAAATACT 
1921 GTTCTTCTAG TGTAGCCGTA GTTAGGCCAC CACTTCAAGA ACTCTGTAGC ACCGCCTACA 
1981 TACCTCGCTC TGCTAATCCT GTTACCAGTG GCTGCTGCCA GTGGCGATAA GTCGTGTCTT 
2041 ACCGGGTTGG ACTCAAGACG ATAGTTACCG GATAAGGCGC AGCGGTCGGG CTGAACGGGG 
2101 GGTTCGTGCA CACAGCCCAG CTTGGAGCGA ACGACCTACA CCGAACTGAG ATACCTACAG 
2161 CGTGAGCTAT GAGAAAGCGC CACGCTTCCC GAAGGGAGAA AGGCGGACAG GTATCCGGTA 
2221 AGCGGCAGGG TCGGAACAGG AGAGCGCACG AGGGAGCTTC CAGGG GGAAA CGCCTGGTAT 
2281 CTTTATAGTC CTGTCGGGTT TCGCCACCTC TGACTTGAGC GT CGATTT TT GTGATGCTCG 
2341 TCAG GGGGGC G GAGC CTATG GAAAAACGCC AGCAACGCGG CCTTTTTACG GTTCCTGGCC 
2401 TTTTGCTGGC CTTTTGCTCA CATGTTCTTT CCTGCGTTAT CCCCTGATTC TGTGGATAAC 
2461 CGTATTACCG CTAGCATGGA TCTCGGGGAC GTCTAACTAC TAAGCGAGAG TAGGGAACTG 
2521 CCAGGCATCA AATAAAACGA AAGGCTCAGT CGGAAGACTG GGCCTTTCGT TTTATCTGTT 
2581 GTTTGTCGGT GAACGCTCTC CTGAGTAGGA CAAATCCGCC GGGAGCGGAT TTGAACGTTG 
2641 TGAAGCAACG GCCCGGAGGG TGGCGGGCAG GACGCCCGCC ATAAACTGCC AGGCATCAAA 
2701 CTAAGCAGAA GGCCATC 
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Cloning Sites of the Entry Vector pENTR2B (reading frame B) 



Int attL1 



Ehel 



XmnI 



Soli 



BamHI 



AAC ATG TTT 



TTfe TAC AAA AAA GCA GGC TEG CGC CGG AAC CAA TTC A GjT CGA CTG jGAT CC G 
TTT CGT CCG fif C GCG GCC TTG GTT AAG TCA GCTjGAC CTA~CjGC 

Leu Tyr Lys Lys Ala Gly Trp Arg Arg Asn Gin Phe Ser Arg Leu Asp Pro 



Kpnl EcoRI 

GTA d CG IaAT TC - ccdB - 
CAT GGC TTA~AlG 

Val Pro Asn 



EcoRI Not I 



Xhol EcoRV Xbal 



g Iaat tc g ccg cac to ag a t|at Icta g| ac~cca 

C TTA /|3C GCC GCjC GTG AGC TjCT ATA GAT C TG GGT 
Asn Ser Arg Pro His Ser Arg Tyr Leu Asp Pro 



Int 



attL2 



GCT Tl jC TTG TAC AAA G 
CGA AAG AAC ATG [TTT C 



Alo Phe Leu Tyr Lys 
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pENTR2B 2718 bp 

Location (Base Nos.) Gene Encoded 

67.. 166 attLl 

322.. 627 ccdB 

656.. 755 attL2 

878.. 1687 KmR 

1792.. 2365 ori 

1 CTGACGGATG GCCTTTTTGC G TTTCT ACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTGGCG CCGGAACCAA 
181 TTCAGTCGAC TGGATCCGGT ACCGAATTCG CTTACTAAAA GCCAGATAAC AGTATGCGTA 
241 TTTGCGCGCT GATTTTTGCG GTATAAGAAT ATATACTGAT ATGTATACCC GAAGTATGTC 
301 AAAAAGAGGT GTGCTTCTAG AATGCAGTTT AAGGTTTACA CCTATAAAAG AGAGAGCCGT 
361 TATCGTCTGT TTGTGGATGT ACAGAGTGAT ATTATTGACA CGCCCGGGCG ACGGATGGTG 
421 ATCCCCCTGG CCAGTGCACG TCTGCTGTCA GATAAAGTCT CCCGTGAACT TTACCCGGTG 
481 GTGCATATCG GGGATGAAAG CTGGCGCATG ATGACCACCG ATATGGCCAG TGTGCCGGTC 
541 TCCGTTATCG GGGAAGAAGT GGCTGATCTC AGCCACCGCG AAAATGACAT CAAAAACGCC 
601 ATTAACCTGA TGTTCTGGGG AATATAGAAT TCGCGGCCGC ACTCGAGATA TCTAGACCCA 
661 GCTTTCTTGT ACAAAGTTGG CATTATAAGA AAGCATTGCT TATCAATTTG TTGCAACGAA 
721 CAGGTCACTA TCAGTCAAAA TAAAATCATT ATTTGCCATC CAGCTGCAGC TGTGGCCCGT 
781 GTCTCAAAAT CTCTGATGTT ACATTGCACA AGATAAAAAT ATATCATCAT GAACAATAAA 
841 ACTGTCTGCT TACATAAACA GTAATACAAG GGGTGTTATG AGCCATATTC AACGGGAAAC 
901 GTCGAGGCCG CGATTAAATT CCAACATGGA TGCTGATTTA TATGGGTATA AATGGGCTCG 
961 CGATAATGTC GGGCAATCAG GTGCGACAAT CTATCGCTTG TATGGGAAGC CCGATGCGCC 
1021 AGAGTTGTTT CTGAAACATG GCAAAGGTAG CGTTGCCAAT GATGTTACAG ATGAGATGGT 
1081 CAGACTAAAC TGGCTGACGG AATTTATGCC TCTTCCGACC ATCAAGCATT TTATCCGTAC 
1141 TCCTGATGAT GCATGGTTAC TCACCACTGC GATCCCCGGA AAAACAGCAT TCCAGGTATT 
1201 AGAAGAATAT CCTGATTCAG GTGAAAATAT TGTTGATGCG CTGGCAGTGT TCCTGCGCCG 
1261 GTTGCATTCG ATTCCTGTTT GTAATTGTCC TTTTAACAGC GATCGCGTAT TTCGTCTCGC 
1321 TCAGGCGCAA TCACGAATGA ATAACGGTTT GGTTGATGCG AGTGATTTTG ATGACGAGCG 
1381 TAATGGCTGG CCTGTTGAAC AAGTCTGGAA AGAAATGCAT AAACTTTTGC CATTCTCACC 
1441 GGATTCAGTC GTCACTCATG GTGATTTCTC ACTTGATAAC CTTATTTTTG ACGAGGGGAA 
1501 ATTAATAGGT TGTATTGATG TTGGACGAGT CGGAATCGCA GACCGATACC AGGAT CTTGC 
1561 CATCCTATGG AACTGCCTCG GTGAGTTTTC TCCTTCATTA CAGAAACGGC TTTTTCAAAA 
1621 ATATGGTATT GATAATCCTG ATATGAATAA ATTGCAGTTT CATTTGATGC TCGATGAGTT 
1681 TTTCTAATCA GAATTGGTTA ATTGGTTGTA ACATTATTCA GATTG GGCCC CGTTCCACTG 
1741 AGCGTCAGAC CCCGTAGAAA AGATCAAAGG ATCTTCTTGA GATCCTTTTT TTCTGCGCGT 
1801 AATCTGCTGC TTGC AAACAA AAAAACCACC GCTACCAGCG GTGGTTTGTT TGCCGGATCA 
1861 AGAGCTACCA ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC AGAGCGCAGA TACCAAATAC 
1921 TGTTCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG AACTCTGTAG CACCGCCTAC 
1981 ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC AGTGGCGATA AGTCGTGTCT 
2041 TACCGGGTTG GACTCAAGAC GATAGTTACC GGATAAGGCG CAGCGGTCGG GCTGAACGGG 
2101 GGGTTCGTGC ACACAGCCCA GCTTGGAGCG AACGACCTAC ACCGAACTGA GATACCTACA 
2161 GCGTGAGCTA TGAGAAAGCG CCACGCTTCC CGAAGGGAGA AAGGCGGACA GGTATCCGGT 
2221 AAGCGGCAGG GTCGGAACAG GAGAGCGCAC GAGGGAGCTT CCAGGG GGAA ACGCCTGGTA 
2281 TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG CGT CGATT TT TGTGATGCTC 
2341 GTCAGGGGGG CGGAGCCTAT GGAAAAACGC CAGCAACGCG GCCTTTTTAC GGTTCCTGGC 
2401 CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TGCTGCGTTA TCCCCTGATT CTGTGGATAA 
2461 CCGTATTACC GCTAGCATGG ATCTCGGGGA CGTCTAACTA CTAAGCGAGA GTAGGGAACT 
2521 GCCAGGCATC AAATAAAACG AAAGGCTCAG TCGGAAGACT GGGCCTTTCG TTTTATCTGT 
2581 TGTTTGTCGG TGAACGCTCT CCTGAGTAGG ACAAATCCGC CGGGAGCGGA TTTGAACGTT 
2641 GTGAAGCAAC GGCCCGGAGG GTGGCGGGCA GGACGCCCGC CATAAACTGC CAGGCATCAA 
2701 ACTAAGCAGA AGGCCATC 

FIG.1 1B 
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Cloning Sites of the Entry Vector pENTR3C (reading frame C) 



Int attLI 



Oral 



XmnI . 



Sail BamHI 



Tf b TAC AAA AAA GCA GGC TlCT TTk AAG GAA cdA ATT CAG [TCGACT GjGA TCC GGT 
AAC ATG TTT |TTT CGT CCG AGA AAT TTC CTT GQT TM GTC AGC^GA CCTAGjG Cp 



Leu Tyr Lys Lys Ala Gly Ser Leu Lys Glu Pro He Gin Ser Thr Gly Ser Gly 



Kpnl EcoRl Pvul 

ACjC G jAA TT C GAT Q GC 
TGG CTTa^G pAGCG 

Thr Glu Phe 



EcoRI Not I 



Xhol EcoRV Xbal 



- - ccdB - -G I AAT TC G C jGG CCG CAC |TCG AG A TlAT [CJA 
C TTA y|GC GCC GGjC GTG AGCtjCT ATA GAT 

Asn Ser Arg Pro His Ser Arg Tyr Leu 



attL2 Int 



GAC CCA GCT TT |C TTG TAC" MA~G 
CTG GGT CGA AAG AAC ATG I TTT C 



Asp Pro Ala Phe Leu Tyr Lys 
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pENTR3C 2723 bp 

Location (Base Nos.) Gene Encoded 

67.. 166 attLl 

327.. 632 ccdB 

661.. 760 attL2 

883 . . 1692 KmR 

1797.. 2370 ori 

1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTCTTT AAAGGAACCA 
181 ATTCAGTCGA CTGGATCCGG TACCGAATTC GATCGCTTAC TAAAAGCCAG ATAACAGTAT 
241 GCGTATTTGC GCGCTGATTT TTGCGGTATA AGAATATATA CTGATATGTA TACCCGMGT 
301 ATGTCAAAAA GAGGTGTGCT TCTAGAATGC AGTTTAAGGT TTACACCTAT AAAAGAGAGA 
361 GCCGTTATCG TCTGTTTGTG GATGTACAGA GTGATATTAT TGACACGCCC GGGCGACGGA 
421 TGGTGATCCC CCTGGCCAGT GCACGTCTGC TGTCAGATAA AGTCTCCCGT GMCTTTACC 
481 CGGTGGTGCA TATCGGGGAT GAAAGCTGGC GCATGATGAC CACCGAWG GCCAGTGTGC 
541 CGGTCTCCGT TATCGGGGAA GAAGTGGCTG ATCTCAGCCA CCGCGAAAAT GACATCAAAA 
601 ACGCCATTAA CCTGATGTTC TGGGGAATAT AGAATTCGCG GCCGCACTCG AGATATCTAG 
661 ACCCAGCTTT CTTGTACAAA GTTGGCATTA TMGAAAGCA JTGCTTATCA ATTTGTTGCA 
721 ACGAACAGGT CACTATCAGT CAAAATAAAA TCATTATTTG CCATCCAGCT GCAGCTCTGG 
781 CCCGTGTCTC AAAATCTCTG ATGTTACATT GCACAAGATA AAAATATATC ATCATGAACA 
841 ATAAAACTGT CTGCTTACAT AAACAGTAAT ACAAGGGGTG TTATGAGCCA TATTCAACGG 
901 GAAACGTCGA GGCCGCGATT AAATTCCAAC ATGGATGCTG ATTTATATGG GTATAAATCG 
961 GCTCGCGATA ATGTCGGGCA ATCAGGTGCG ACAATCTATC GCTTGTATGG GAAGCCCGAT 
1021 GCGCCAGAGT TGTTTCTGAA ACATGGCAAA GGTAGCGTTG CCMTGATGT TACAGATGAG 
1081 ATGGTCAGAC TAAACTGGCT GACGGAATTT ATGCCTCTTC CGACCATCAA GCATJTTATC 
1141 CGTACTCCTG ATGATGCATG GTTACTCACC ACTGCGATCC CCGGAAAAAC AGCATTCCAG 
1201 GTATTAGAAG AATATCCTGA TTCAGGTGAA AATATTGTTG ATGCGCTGGC AGTGTTCCTG 
1261 CGCCGGTTGC ATTCGATTCC TGTTTGTAAT TGTCCTTTTA ACAGCGATCG CGT^TTTCGT 
1321 CTCGCTCAGG CGCAATCACG AATGAATAAC GGTTTGGTTG ATGCGAGTGA TTTTGATGAC 
1381 GAGCGTAATG GCTGGCCTGT TGAACAAGTC TGGAAAGAAA TGCATAAACT TTTGCCATTC 
1441 TCACCGGATT CAGTCGTCAC TCATGGTGAT TTCTCACTTG ATAACCTTAT TTTTGACGAG 
1501 GGGAAATTAA TAGGTTGTAT TGATGTTGGA CGAGTCGGAA TCGCAGACCG ATACCAGGAT 
1561 CTTGCCATCC TATGGAACTG CCTCGGTGAG TTTTCTCCTT CATTACAGAA ACGGCTTTTT 
1621 CAAAAATATG GTATTGATAA TCCTGATATG AATAAATTGG AGTTTCATTT GATGCTCGAT 
1681 GAGTTTTTCT AATCAGAATT GGTTAATTGG TTGTAACATT ATTCAGATTG GGCCCCGT TC 
1741 CACTGAGCGT CAGACCCCGT AGAAAAGATC AAAGGATCTT CTTGAGATCC N I III iCTG 
1801 CGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG 
1861 GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAACTGGCT TCAGCAGAGC GCAGATACCA 
1921 AATACTGTTC TTCTAGTGTA GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG 
1981 CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG 
2041 TGTCTOCCG GGTTGGACTC AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA 
2101 ACGGGGGGTT CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC 
2161 CTACAGCGTG AGCTATGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT 
2221 CCGGTAAGCG GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG G GGAAA CGCC 
2281 TGGTATCTTT ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA 
2341 TGCTCGTCAG GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC 
2401 CTGGCCTTTT GCTGGCCTTT TGCTCACATG TTCTTTCCTG CGTWCCCC TGATTCTGTG 
2461 GATAACCGTA TTACCGCTAG CATGGATCTC GGGGACGTCT AACTACTAAG CGAGAGTAGG 
2521 GAACTGCCAG GCATCAAATA AAACGAAAGG CTCAGTCGGA AGACTGGGCC TTTCGTTTTA 
2581 TCTGTTGTTT GTCGGTGAAC GCTCTCCTGA GTAGGACAAA TCCGCCGGGA GCGGATTTGA 
2641 ACGTTGTGAA GCAACGGCCC GGAGGGTGGC GGGCAGGACG CCCGCCATAA ACTGCCAGGC 
2701 ATCAAACTAA GCAGAAGGCC ATC 

FIG.1 2B 
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Cloning Sites of the Entry Vector pENTR4 



Int ottLI 



Ncol 



Kozok XmnI 



So 1 1 BomH I 



AAT TCA G |TC GAC TG b ATC CGG 
TTA AGT CAGCljG ACC TAGj GCj; 

Leu Tyr Lys Lys Ala Gly Ser Thr Met Gly Thr Asn Ser Vol Asp Trp He Arg 



TT |G TAC AAA AAA GCA GGC TbC Af jc ATG GGA ACC 
AAC ATG TTT | TTT CGT CCG AG G TGG TACjCCT TGG 



Kpnl EcoRI 

TACl CG lA ATT C- 
ATG GCTTaaJg 

Tyr Arg He 



EcoRI 



Not I 



Xhol EcoRV Xbal 



ccdB - - G lAAT TC G dGG CCG CAC [TCG AG A TlAT |CTA G AC CCA GCT 
C TT/TaISC GC&~§^C GTG AGC ljCT ATA GAT C TG GGT CGA 

Asn Ser Arg Pro His Ser Arg Tyr Leu Asp Pro Ala 



Int attL2 

Tl jc TTG TAC AAA G 
AAG AAC ATG I TTT C 



Phe Leu Tyr Lys 



FIG.13A 
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pENTR4 2720 bp 



Location (Base Nos.) 



67. .166 
324.. 629 
658.. 757 
880 . . 1689 



Gene Encoded 

attLl 

ccdB 

attL2 

KmR 



1794.. 2367 



ori 



1 CTGACGGATG GCCI I 1 1 1GC G TTTC TACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT I I I I IATAAT GCCAACTTTG TACAAAAAAG CAGGCTCCAC CATGGGAACC 
181 AATTCAGTCG ACTGGATCCG GTACCGAATT CGCTTACTAA AAGCCAGATA ACAGTATGCG 
241 TATTTGCGCG CTGATTTTTG CGGTATAAGA ATATATACTG ATATGTATAC CCGAAGTATG 
301 TCAAAAAGAG GTGTGCTTCT AGAATGCAGT TTAAGGTTTA CACCTATAAA AGAGAGAGCC 
361 GTTATCGTCT GTTTGTGGAT GTACAGAGTG ATATTATTGA CACGCCCGGG CGACGGATGG 
421 TGATCCCCCT GGCCAGTGCA CGTCTGCTGT CAGATAAAGT CTCCCGTGAA CTTTACCCGG 
481 TGGTGCATAT CGGGGATGAA AGCTGGCGCA TGATGACCAC CGATATGGCC AGTGTGCCGG 
541 TCTCCGTTAT CGGGGAAGAA GTGGCTGATC TCAGCCACCG CGAAAATGAC ATCAAAAACG 
601 CCATTAACCT GATGTTCTGG GGAATATAGA ATTCGCGGCC GCACTCGAGA TATCTAGACC 
661 CAGCTTTCTT GTACAAAGTT GGCATTATAA GAAAGCATTG CTTATCAATT TGTTGCAACG 
721 AACAGGTCAC TATCAGTCAA AATAAAATCA TTATTTGCCA TCCAGCTGCA GCTCTGGCCC 
781 GTGTCTCAAA ATCTCTGATG TTACATTGCA CAAGATAAAA ATATATCATC ATGAACAATA 
841 AAACTGTCTG CTTACATAAA CAGTAATACA AGGGGTGTTA TGAGCCATAT TCAACGGGAA 
901 ACGTCGAGGC CGCGATTAAA TTCCAACATG GATGCTGATT TATATGGGTA TAAATGGGCT 
961 CGCGATAATG TCGGGCAATC AGGTGCGACA ATCTATCGCT TGTATGGGAA GCCCGATGCG 
1021 CCAGAGTTGT TTCTGAAACA TGGCAAAGGT AGCGTTGCCA ATGATGTTAC AGAT GAGATG 
1081 GTCAGACTAA ACTGGCTGAC GGAATTTATG CCTCTTCCGA CCATCAAGCA TTTTATCCGT 
1141 ACTCCTGGTG ATGCATGGTT ACTCACCACT GCGATCCCCG GAAAAACAGC ATTCCAGGTA 
1201 TTAGAAGAAT ATCCTGATTC AGGTGAAAAT ATTGTTGATG CGCTGGCAGT GTTCCTGCGC 
1261 CGGTTGCATT CGATTCCTGT TTGTAATTGT CCTTTTAACA GCGATCGCGT ATTTCGTCTC 
1321 GCTCAGGCGC AATCACGAAT GAATAACGGT TTGGTTGATG CGAGTGATTT TGATGACGAG 
1381 CGTAATGGCT GGCCTGTTGA ACAAGTCTGG AAAGAAATGC ATAAAC TTTT GCCATTCTCA 
1441 CCGGATTCAG TCGTCACTCA TGGTGATTTC TCACTTGATA ACCTTATTTT TGACGAGGGG 
1501 AAATTAATAG GTTGTATTGA TGTTGGACGA GTCGGAATCG CAGACCGATA CC AGGATC TT 
1561 GCCATCCTAT GGAACTGCCT CGGTGAGTTT TCTCCTTCAT TACAGAAACG GCTTTTTCAA 
1621 AAATAT GGTA TTGATAATCC TGATATGAAT AAATTGCAGT TTCATTTGAT GCTCGATGAG 
1681 I I I I ICTAAT CAGAATTGGT TAATTGGTTG TAACATTATT CAGATTGGGC CCCGT TCCAC 
1741 TGAGCGTCAG ACCCCGTAGA AAAGATCAAA GGATCTTCTT GAGATCCTTT TTTTCTGCGC 
1801 GTAATCTGCT GCTTGCAAAC AAAAAAACCA CCGCTACCAG CGGTGGTTTG TTTGCCGGAT 
1861 CAAGAGCTAC CAACTCTTTT TCCGAAGGTA ACTGGCTTCA GCAGAGCGCA GATACCAAAT 
1921 ACTGTTCTTC TAGTGTAGCC GTAGTTAGGC CACCACTTCA AGAACTCTGT AGCACCGCCT 
1981 ACATACCTCG CTCTGCTAAT CCTGTTACCA GTGGCTGCTG CCAGTGGCGA TAAGTCGTGT 
2041 CTTACCGGGT TGGACTCAAG ACGATAGTTA CCGGATAAGG CGCAGCGGTC GGGCTGAACG 
2101 GGGGGTTCGT GCACACAGCC CAGCTTGGAG CGAACGACCT ACACCGAACT GAGATACCTA 
2161 CAGCGTGAGC TATGAGAAAG CGCCACGCTT CCCGAAGGGA GAAAGGCGGA CAGGTATCCG 
2221 GTAAGCGGCA GGGTCGGAAC AGGAGAGCGC ACGAGGGAGC TTCCAGGGGG AAACGCCTGG 
2281 TATCTTTATA GTCCTGTCGG GTTTCGCCAC CTCTGACTTG AGCG TCGATT TTTGTGATGC 
2341 TCG TCAG GGG GGCGGAGCCT ATGGAAAAAC GCCAGCAACG CGGCCTTTTT ACGGTTCCTG 
2401 GCCTTTTGCT GGCCTTTTGC TCACATGTTC TTTCCTGCGT TATCCCCTGA TTCTGTGGAT 
2461 AACCGTATTA CCGCTAGCAT GGATCTCGGG GACGTCTAAC TACTAAGCGA GA GTAG GGAA 
2521 CTGCCAGGCA TCAAATAAAA CGAAAGGCTC AGTCGGAAGA CTGGGCCTTT CGTTTTATCT 
2581 GTTGTTTGTC GGTGAACGCT CTCCTGAGTA GGACAAATCC GCCGGGAGCG GATTTGAACG 
2641 TTGTGAAGCA ACGGCCCGGA GGGTGGCGGG CAGGACGCCC GCCATAAACT GCCAGGCATC 
2701 AAACTAAGCA GAAGGCCATC 



FIG.13B 
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Cloning sites of the Entry Vector pENTR5 



Int 



oltLI 



Ndel 




toe aoo oaa gco ggc ttt cajtatg gga 
ttt cgt ccg aoa gtaDac cct 



Xmn I Sail 



acc aat tea g|tc_ 
tgg tta agt cag 



Leu Tyr Lys Lys Alo Gly Phe His Met Gly Thr Asn Ser Vol 



BamHI Kpnl EcoRI 
gac tg |g otc egg tac cg jo ott cgc- 
cjtg acc tagl gcjc atg get taa gcg- 
I II 
Asp Trp He Arg Tyr Arg He 



-Death _. 

-(ccdB)— tcFTao 



EcoRI 

dg ja att cgc 



gcg 



Not I Xhol EcoRV Xba 
|ggc cg c oc|t_cga gat 
ccg g|;g tga get I eta 



Int attL2 



ot ic tag 



tag ate 



acc cag ctt t^ r/V3/ . /T -^ 
tgg gtc gao ago aca tg[^ 



FIG. HA 
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pENTR5 2720 bp 

Location (Base Nos.) Gene Encoded 

67.. 166 attLl 

324..629 ccdB 

658.. 757 attL2 

880 . . 1689 KmR 

1794.. 2367 ori 

1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTTTCA TATGGGAACC 
181 AATTCAGTCG ACTGGATCCG GTACCGAATT CGCTTACTAA AAGCCAGATA ACAGTATGCG 
241 TATTTGCGCG CTGATTTTTG CGGTATAAGA ATATATACTG ATATGTATAC CCGAAGTATG 
301 TCAAAAAGAG GTGTGCTTCT AGAATGCAGT TTAAGGTTTA CACCTATAAA AGAGAGAGCC 
361 GTTATCGTCT GTTTGTGGAT GTACAGAGTG ATATTATTGA CACGCCCGGG CGACGGATGG 
421 TGATCCCCCT GGCCAGTGCA CGTCTGCTGT CAGATAAAGT CTCCCGTGAA CTTTACCCGG 
481 TGGTGCATAT CGGGGATGAA AGCTGGCGCA JGATGACCAC CGATATGGCC AGTGTGCCGG 
541 TCTCCGTTAT CGGGGAAGAA GTGGCTGATC TCAGCCACCG CGAAAATGAC ATCAAAAACG 
601 CCATTAACCT GATGTTCTGG GGAATATAGA ATTCGCGGCC GCACTCGAGA TATCTAGACC 
661 CAGCTTTCTT GTACAAAGTT GGCATTATAA GAAAGCATTG CTTATCAATT TGTTGCAACG 
721 AACAGGTCAC TATCAGTCAA AATAAAATCA TTATTTGCCA TCCAGCTGCA GCTCTGGCCC 
781 GTGTCTCAAA ATCTCTGATG TTACATTGCA CAAGATAAAA ATATATCATC ATGAACAATA 
841 AAACTGTCTG CTTACATAAA CAGTAATACA AGGGGTGTTA TGAGCCATAT TCAACGGGAA 
901 ACGTCGAGGC CGCGATTAAA TTCCAACATG GATGCTGATT TATATGGGTA TAAATGGGCT 
961 CGCGATAATG TCGGGCAATC AGGTGCGACA ATCTATCGCT TGTATGGGAA GCCCGATGCG 
1021 CCAGAGTTGT TTCTGAAACA TGGCAAAGGT AGCGTTGCCA ATGATGTTAC AGATGAGATG 
1081 GTCAGACTAA ACTGGCTGAC GGAATTTATG CCTCTTCCGA CCATCAAGCA TTTTATCCGT 
1141 ACTCCTGATG ATGCATGGTT ACTCACCACT GCGATCCCCG GAAAAACAGC ATTCCAGGTA 
1201 TTAGAAGAAT ATCCTGATTC AGGTGAAAAT ATTGTTGATG CGCTGGCAGT GTTCCTGCGC 
1261 CGGTTGCATT CGATTCCTGT TTGTAATTGT CCTTTTAACA GCGATCGCGT ATTTCGTCTC 
1321 GCTCAGGCGC AATCACGAAT GAATAACGGT TTGGTTGATG CGAGTGATTT TGATGACGAG 
1381 CGTAATGGCT GGCCTGTTGA ACAAGTCTGG AAAGAAATGC ATAMCTTTT GCCATTCTCA 
1441 CCGGATTCAG TCGTCACTCA TGGTGATTTC TCACTTGATA ACCTTATTTT TGACGAGGGG 
1501 AAATTAATAG GTTGTATTGA TGTTGGACGA GTCGGAATCG CAGACCGATA CC AGGAT CTT 
1561 GCCATCCTAT GGAACTGCCT CGGTGAGTTT TCTCCTTCAT TACAGAAACG GCTTTTTCAA 
1621 AAATAT GGTA TTGATAATCC TGATATGAAT AAATTGCAGT TTCATTTGAT GCTCGATGAG 
1681 TTTTTCTAAT CAGAATTGGT TAATTGGTTG TAACATTATT CAGATTGGGC CCCGTTCCAC 
1741 TGAGCGTCAG ACCCCGTAGA AAAGATCAAA GGATCTTCTT GAGATCCTTT 1TTTCTGCGC 
1801 GTAATCTGCT GCTTGCAAAC AAAAAAACCA CCGCTACCAG CGGTGGTTTG TTTGCCGGAT 
1861 CAAGAGCTAC CAACTCTTTT TCCGAAGGTA ACTGGCTTCA GCAGAGCGCA GATACCAMT 
1921 ACTGTTCTTC TAGTGTAGCC GTAGTTAGGC CACCACTTCA AGAACTCTGT AGCACCGCCT 
1981 ACATACCTCG CTCTGCTAAT CCTGTTACCA GTGGCTGCTG CCAGTGGCGA TAAGTCGTGT 
2041 CTTACCGGGT TGGACTCAAG ACGATAGTTA CCGGATAAGG CGCAGCGGTC GGGCTGAACG 
2101 GGGGGTTCGT GCACACAGCC CAGCTTGGAG CGAACGACCT ACACCGAACT GAGATACCTA 
2161 CAGCGTGAGC TATGAGAAAG CGCCACGCTT CCCGAAGGGA GAAAGGCGGA CAGGTATCCG 
2221 GTAAGCGGCA GGGTCGGAAC AGGAGAGCGC ACGAGGGAGC TTCCAGGGGG AAACGCCTGG 
2281 TATCTTTATA GTCCTGTCGG GTTTCGCCAC CTCTGACTTG AGCGT CGATT TTTGTGATGC 
2341 TCGTCAGGGG GGCGGAGCCT ATGGAAAAAC GCCAGCAACG CGGCCTTTTT ACGGTTCCTG 
2401 GCCTTTTGCT GGCCTTTTGC TCACATGTTC TTTCCTGCGT TATCCCCTGA TTgGTGGAT 
2461 AACCGTATTA CCGCTAGCAT GGATCTCGGG GACGTCTAAC TACTAAGCGA GAGTAGGGAA 
2521 CTGCCAGGCA TCGAATAAAA CGAAAGGCTC AGTCGGAAGA CTGGGCCTTT CGTTTTATCT 
2581 GTTGTTTGTC GGTGAACGCT CTCCTGAGTA GGACAAATCC GCCGGGAGCG GATTTGAACG 
2641 TTGTGAAGCA ACGGCCCGGA GGGTGGCGGG CAGGACGCCC GCCATAAACT GCCAGGCATC 
2701 AAACTAAGCA GAAGGCCATC 

FIG.14B 
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Int 




Cloning sites of the Entry Vector pENTR6 
attLI Sphl Xmnl 



Soli 



|g toe ooo ooo gco ggc tgc otq| ego occ 



oot tea g|tc 
ttp pgt cog 



♦ T 

Leu Tyr Lys Lys Ala Gly Cys Mel Arg Thr Asn Ser Vol 



e^ftfcfffl ttt cgt ccg a cta toe get tgg 



BamHI Kpnl EcoRI 
gac tg |g otc eggtacj cg |a att cgc — 

ctglacc tag gec otg get too gcg- 
ill I 

Asp Trp lie Arg Tyr Arg lie 



Death - 

(ccdB) — tct too 



EcoRI 

oq jo ott cgc 
gcg 




FIG.15A 
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pENTR6 2717 bp 

Location (Base Nos.) Gene Encoded 

67.. 166 attLl 

321.. 626 ccdB 

655 . . 754 attL2 

877 . . 1686 KmR 

1791.. 2364 ori 

1 CTGACGGATG GCCTTTTTGC G TTTCT ACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTGCAT GCGAACCAAT 
181 TCAGTCGACT GGATCCGGTA CCGAATTCGC TTACTAAAAG CCAGATAACA GTATGCGTAT 
241 TTGCGCGCTG ATTTTTGCGG TATAAGAATA TATACTGATA TGTATACCCG AAGTATGTCA 
301 AAAAGAGGTG TGCTTCTAGA ATGCAGTTTA AGGTTTACAC CTATAAAAGA GAGAGCCGTT 
361 ATCGTCTGTT TGTGGATGTA CAGAGTGATA TTATTGACAC GCCCGGGCGA CGGATGGTGA 
421 TCCCCCTGGC CAGTGCACGT CTGCTGTCAG ATAAAGTCTC CCGTGAACTT TACCCGGTGG 
481 TGCATATCGG GGATGAAAGC TGGCGCATGA TGACCACCGA TATGGCCAGT GTGCCGGTCT 
541 CCGTTATCGG GGAAGAAGTG GCTGATCTCA GCCACCGCGA AAATGACATC AAAAACGCCA 
601 TTAACCTGAT GTTCTGGGGA ATATAGAATT CGCGGCCGCA CTCGAGATAT CTAGACCCAG 
661 CTTTCTTGTA CAAAGTTGGC ATTATAAGAA AGCATTGCTT ATCAATTTGT TGCAACGAAC 
721 AGGTCACTAT CAGTCAAAAT AAAATCATTA TTTGCCATCC AGCTGCAGCT CTGGCCCGTG 
781 TCTCAAAATC TCTGATGTTA CATTGCACAA GATAAAAATA TATCATCATG AACAATAAAA 
841 CTGTCTGCTT ACATAAACAG TAATACAAGG GGTGTTATGA GCCATATTCA ACGGGAAACG 
901 TCGAGGCCGC GATTAAATTC CAACATGGAT GCTGATTTAT ATGGGTATAA ATGGGCTCGC 
961 GATAATGTCG GGCAATCAGG TGCGACAATC TATCGCTTGT ATGGGAAGCC CGATGCGCCA 
1021 GAGTTGTTTC TGAAACATGG CAAAGGTAGC GTTGCCAATG ATGTTACAGA TGAGATGGTC 
1081 AGACTAAACT GGCTGACGGA ATTTATGCCT CTTCCGACCA TCAAGCATTT TATCCGTACT 
1141 CCTGATGATG CATGGTTACT CACCACTGCG ATCCCCGGAA AAACAGCATT CCAGGTATTA 
1201 GAAGAATATC CTGATTCAGG TGAAAATATT GTTGATGCGC TGGCAGTGTT CCTGCGCCGG 
1261 TTGCATTCGA TTCCTGTTTG TAATTGTCCT TTTAACAGCG ATCGCGTATT TCGTCTCGCT 
1321 CAGGCGCAAT CACGAATGAA TAACGGTTTG GTTGATGCGA GTGATTTTGA TGACGAGCGT 
1381 AATGGCTGGC CTGTTGAACA AGTCTGGAAA GAAATGCATA AAC TTTTG CC ATTCTCACCG 
1441 GATTCAGTCG TCACTCATGG TGATTTCTCA CTTGATAACC TTATTTTTGA CGAGGGGAAA 
1501 TTAATAGGTT GTATTGATGT TGGACGAGTC GGAATCGCAG ACCGATACCA GGAT CTTGCC 
1561 ATCCTATGGA ACTGCCTCGG TGAGTTTTCT CCTTCATTAC AGAAACGGCT TTTTCAAAAA 
1621 TATGGTATTG ATAATCCTGA TATGAATAAA TTGCAGTTTC ATTTGATGCT CGATGAGTTT 
1681 TTCTAATCAG AATTGGTTAA TTGGTTGTAA CATTATTCAG ATTG GGCCCC GTTCCACTGA 
1741 GCGTCAGACC CCGTAGAAAA GATCAAAGGA TCTTCTTGAG ATCCI I 1 1 1 1 TCTGCGCGTA 
1801 ATCTGCTGCT TGCAAACAAA AAAACCACCG CTACCAGCGG TGGTTTGTTT GCCGGATCAA 
1861 GAGCTACCAA CTCTTTTTCC GAAGGTAACT GGCTTCAGCA GAGCGCAGAT ACCAAATACT 
1921 GTTCTTCTAG TGTAGCCGTA GTTAGGCCAC CACTTCAAGA ACTCTGTAGC ACCGCCTACA 
1981 TACCTCGCTC TGCTAATCCT GTTACCAGTG GCTGCTGCCA GTGGCGATAA GTCGTGTCTT 
2041 ACCGGGTTGG ACTCAAGACG ATAGTTACCG GATAAGGCGC AGCGGTCGGG CTGAACGGGG 
2101 GGTTCGTGCA CACAGCCCAG CTTGGAGCGA ACGACCTACA CCGAACTGAG ATACCTACAG 
2161 CGTGAGCTAT GAGAAAGCGC CACGCTTCCC GAAGGGAGAA AGGCGGACAG GTATCCGGTA 
2221 AGCGGCAGGG TCGGAACAGG AGAGCGCACG AGGGAGCTTC CAGGG GGAAA CGCCTGGTAT 
2281 CTTTATAGTC CTGTCGGGTT TCGCCACCTC TGACTTGAGC GT CGATTT TT GTGATGCTCG 
2341 TCAGGGGGGC G GAGC CTATG GAAAAACGCC AGCAACGCGG CCTTTTTACG GTTCCTGGCC 
2401 TTTTGCTGGC CTTTTGCTCA CATGTTCTTT CCTGCGTTAT CCCCTGATTC TGTGGATAAC 
2461 CGTATTACCG CTAGCATGGA TCTCGGGGAC GTCTAACTAC TAAGCGAGAG TAGGGAACTG 
2521 CCAGGCATCA AATAAAACGA AAGGCTCAGT CGGAAGACTG GGCCTTTCGT TTTATCTGTT 
2581 GTTTGTCGGT GAACGCTCTC CTGAGTAGGA CAAATCCGCC GGGAGCGGAT TTGAACGTTG 
2641 TGAAGCAACG GCCCGGAGGG TGGCGGGCAG GACGCCCGCC ATAAACTGCC AGGCATCAAA 
2701 CTAAGCAGAA GGCCATC 

FIG.15B 
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Cloning sites of the Entry Vector pENTR7 
attU 



— ttg toe ooo ooa gco ggc ttt goo ooc ctg tot ttt coa gga 
— aac atg tttl ttt cgt ccg ooa ctt ttg gac ata ooa gtt cct 



T 

Leu Tyr Lys Lys Alo Gly Phe Glu Asn Leu Tyr Phe Gln^Gly 

TEV Protease 



XmnI 



Soli 



Bam 



occ 
tgg 



Kpnl EcoRI 



gtt tea tgc ate g [tc ga c tg fg ate e gg tac| cg ja ott cgc — 
coa agt acg tog cog ctjg occ togj gcjc atg get tocj gcg — •■ 

Thr Vol Ser Cys He Vol Asp Trp lie Arg Tyr Arg He 



EcoRI 



Not I 



-Death^ 

•V / / / / / 

-.(ccdB); 



— aga att cgc 



Xhol EcoRV Xbal 



ggc cgc ac [t cga gat 
— tct taajgcg ccg gjcg tga getjeta 



ate tag 



tag ate 



acc cag 
tgg gtc 



T 



Int 



att L2 



ctt tct tgt oca aag — 



gaa aga aca tgt ttc — 
T 



FIG.16A 
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pENTR7 2738 bp 

Location (Base Nos.) Gene Encoded 

67.. 166 attLl 

342.. 647 ccdB 

676.. 775 attL2 

898.. 1707 KmR 

1812.. 2385 ori 

1 CTGACGGATG GCCI I 1 1 IGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTTTGA AAACCTGTAT 
181 TTTCAAGGAA CCGTTTCATG CATCGTCGAC TG GATCC GGT ACCGAATTCG CTTACTAAAA 
241 GCCAGATAAC AGTATGCGTA TTTGCGCGCT GATTTTTGCG GTATAAGAAT ATATACTGAT 
301 ATGTATACCC GAAGTATGTC AAAAAGAGGT GTGCTTCTAG AATGCAGTTT AAGGTTTACA 
361 CCTATAAAAG AGAGAGCCGT TATCGTCTGT TTGTGGATGT ACAGAGTGAT ATTATTGACA 
421 CGCCCGGGCG ACGGATAGTG ATCCCCCTGG CCAGTGCACG TCTGCTGTCA GATAAAGTCT 
481 CCCGTGAACT TTACCCGGTG GTGCATATCG GGGATGAAAG CTGGCGCATG ATGACCACCG 
541 ATATGGCCAG TGTGCCGGTC TCCGTTATCG GGGAAGAAGT GGCTGATCTC AGCCACCGCG 
601 AAAATGACAT CAAAAACGCC ATTAACCTGA TGTTCTGGGG AATATAGAAT TCGCGGCCGC 
661 ACTCGAGATA TCTAGACCCA GCTTTCTTGT ACAAAGTTGG CATTATAAGA AAGCATTGCT 
721 TATCAATTTG TTGCAACGAA CAGGTCACTA TCAGTCAAAA TAAAATCATT ATTTGCCATC 
781 CAGCTGCAGC TCTGGCCCGT GTCTCAAAAT CTCTGATGTT ACATTGCACA AGATAAAAAT 
841 ATATCATCAT GAACAATAAA ACTGTCTGCT TACATAAACA GTAATACAAG GGGTGTTATG 
901 AGCCATATTC AACGGGAAAC GTCGAGGCCG CGATTAAATT CCAACATGGA TGCTGATTTA 
961 TATGGGTATA AATGGGCTCG CGATAATGTC GGGCAATCAG GTGCGACAAT CTATCGCTTG 
1021 TATGGGAAGC CCGATGCGCC AGAGTTGTTT CTGAAACATG GCAAAGGTAG CGTTGCCAAT 
1081 GATGTTACAG ATGAGATGGT CAGACTAAAC TGGCTGACGG AATTTATGCC TCTTCCGACC 
1141 ATCAAGCATT TTATCCGTAC TCCTGATGAT GCATGGTTAC TCACCACTGC GATCCCCGGA 
1201 AAAACAGCAT TCCAGGTATT AGAAGAATAT CCTGATTCAG GTGAAAATAT TGTT GATGCG 
1261 CTGGCAGTGT TCCTGCGCCG GTTGCATTCG ATTCCTGTTT GTAATTGTCC TTTTAACAGC 
1321 GATCGCGTAT TTCGTCTCGC TCAGGCGCAA TCACGAATGA ATAACGGTTT GGTTGATGCG 
1381 AGTGATTTTG ATGACGAGGG TAATGGCTGG CCTGTTGAAC AAGTCTGGAA AGAAATGCAT 
1441 AAAC TTTTG C CATTCTCACC GGATTCAGTC GTCACTCATG GTGATTTCTC ACTTGATAAC 
1501 CTTA I I I I I G ACGAGGGGAA ATTAATAGGT TGTATTGATG TTGG ACGAGT CGGAATCGCA 
1561 GACCGATACC AGGAT CTTGC CATCCTATGG AACTGCCTCG GTGAGTTTTC TCCTTCATTA 
1621 CAGAAACGGC TTTTTCAAAA ATATGGTATT GATAATCCTG ATATGAATAA ATTGCAGTTT 
1681 CATTTGATGC TCGATGAGTT TTTCTAATCA GAATTGGTTA ATTGGTTGTA ACATTATTCA 
1741 GATTGGGCCC CGTTCCACTG AGCGTCAGAC CCCGTAGAM AGATCAAAGG ATCTTCTTGA 
1801 GATCC I I I I I TTCTGCGCGT AATCTGCTGC TTGC AAACAA AAAAACCACC GCTACCAGCG 
1861 GTGGTTTGTT TGCCGGATCA AGAGCTACCA ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC 
1921 AGAGCGCAGA TACCAAATAC TGTTCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG 
1981 AACTCTGTAG CACCGCCTAC ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC 
2041 AGTGGCGATA AGTCGTGTCT TACCGGGTTG GACTCAAGAC GATAGTTACC GGATAAGGCG 
2101 CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA GCTTGGAGCG AACGACCTAC 
2161 ACCGAACTGA GATACCTACA GCGTGAGCTA TGAGAAAGCG CCACGCTTCC CGAAGGGAGA 
2221 AAGGCGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG GAGAGCGCAC GAGGGAGCTT 
2281 CCAGGG GGAA ACGCCTGGTA TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG 
2341 CGT CGATTT T TGTGATGCTC GTCAGGGGGG CGGAGCCTAT GGAAAAACGC CAGCAACGCG 
2401 GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TCCTGCGTTA 
2461 TCCCCTGATT CTGTGGATAA CCGTATTACC GCTAGCATGG ATCTCGGGGA CGTCTAACTA 
2521 CTAAGCGAGA GTAG GGAACT GCCAGGCATC AAATAAAACG AAAGGCTCAG TCGGAAGACT 
2581 GGGCCTTTCG TTTTATCTGT TGTTTGTCGG TGAACGCTCT CCTGAGTAGG ACAAATCCGC 
2641 CGGGAGCGGA TTTGAACGTT GTGAAGCAAC GGCCCGGAGG GTGGCGGGCA GGACGCCCGC 
2701 CATAAACTGC CAGGCATCAA ACTAAGCAGA AGGCCATC 

FIG.16B 
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CLONING SITES OF THE ENTRY VECTOR pENTR8 



Int 



attLI 




tac ooo oaa gca ggc ttt 
ttt cgt ccg oao 



Leu Tyr Lys 



gaa aac cLg tat ttt caa gga 
ctt ttg gac ata aaa gtt cct 

Lys Ala Gly Phe Glu Asn Leu Tyr Phe Gln^Gly 

TEV Protease 



Ncol Avail Sail BamHI Kpnl EcoRI 

cc |c atg [gac eta g |tc ga c tg|g ate eg g tacl cg|a at t cgc --- 

tgg tacjctgjgat cag ctjg acc tagjgcjc atg get taajgcg --- 

Thr Met Asp Leu Vol Asp Trp He Arg Tyr Arg lie 



EcoRI Not I Xhol EcoRV Xbal 

Death ---ag^_att cgc [ajcege oc |t ego gat ot |c tag 
— tct taajgcg ccg gjcg tga getjeta tag atcj 



attL 



acc cag 
tgg gtc 



Int 

ctt teM&A& 

gaa aga aca tg 




FIG.17A 
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pENTR8 2735 bp 

Location (Base Nos.) Gene Encoded 

67.. 166 attLl 

339.. 644 ccdB 

673.. 772 attL2 

895.. 1704 KmR 

1809.. 2382 ori 

1 CTGACGGATG GCC1 1 1 1 IGC G TTTC TACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT I I I I IATAAT GCCAACTTTG TACAAAAAAG CAGGCTTTGA AAACCTGTAT 
181 TTTCAAGGAA CCATGGACCT AGTCGACTGG ATCC GGTACC GAATTCGCTT ACTAAAAGCC 
241 AGATAACAGT ATGCGTATTT GCGCGCTGAT TTTTGCGGTA TAAGAATATA TACTGATATG 
301 TATACCCGAA GTATGTCAAA AAGAGGTGTG CTTCTAGAAT GCAGTTTAAG GTTTACACCT 
361 ATAAAAGAGA GAGCCGTTAT CGTCTGTTTG TGGATGTACA GAGTGATATT ATTGACACGC 
421 CCGGGCGACG GATAGTGATC CCCCTGGCCA GTGCACGTCT GCTGTCAGAT AAAGTCTCCC 
481 GTGAACTTTA CCCGGTGGTG CATATCGGGG ATGAAAGCTG GCGCATGATG ACCACCGATA 
541 TGGCCAGTGT GCCGGTCTCC GTTATCGGGG AAGAAGTGGC TGATCTCAGC CACCGCGAAA 
601 ATGACATCAA AAACGCCATT AACCTGATGT TCTGGGGAAT ATAGAATTCG CGGCCGCACT 
661 CGAGATATCT AGACCCAGCT TTCTTGTACA AAGTTGGCAT TATAAGAAAG CATTGCTTAT 
721 CAATTTGTTG CAACGAACAG GTCACTATCA GTCAAAATAA AATCATTATT TGCCATCCAG 
781 CTGCAGCTCT GGCCCGTGTC TCAAAATCTC TGATGTTACA TTGCACAAGA TAAAAATATA 
841 TCATCATGAA CAATAAAACT GTCTGCTTAC ATAAACAGTA ATACAAGGGG TGTTATGAGC 
901 CATATTCAAC GGGAAACGTC GAGGCCGCGA TTAAATTCCA ACATGGATGC TGATTTATAT 
961 GGGTATAAAT GGGCTCGCGA TAATGTCGGG CAATCAGGTG CGACAATCTA TCGCTTGTAT 
1021 GGGAAGCCCG ATGCGCCAGA GTTGTTTCTG AAACATGGCA AAGGTAGCGT TGCCAATGAT 
1081 GTTACAGATG AGATGGTCAG ACTAAACTGG CTGACGGAAT TTATGCCTCT TCCGACCATC 
1141 AAGCATTTTA TCCGTACTCC TGATGATGCA TGGTTACTCA CCACTGCGAT CCCCGGAAAA 
1201 ACAGCATTCC AGGTATTAGA AGAATATCCT GATTCAGGTG AAAATATTGT TGATGCGCTG 
1261 GCAGTGTCCC TGCGCCGGTT GCATTCGATT CCTGTTTGTA ATTGTCCTTT TAACAGCGAT 
1321 CGCGTATTTC GTCTCGCTCA GGCGCAATCA CGAATGAATA ACGGTTTGGT TGATGCGAGT 
1381 GATTTTGATG ACGAGCGTAA TGGCTGGCCT GTTGAACAAG TCTGGAAAGA AATGCATAAA 
1441 C TTTTG CCAT TCTCACCGGA TTCAGTCGTC ACTCATGGTG ATTTCTCACT TGATAACCTT 
1501 Al I I I I GACG AGGGGAAATT AATAGGTTGT ATTGATGTTG GA CGAGT CGG AATCGCAGAC 
1561 CGATACCAGG ATCTTGCCAT CCTATGGAAC TGCCTCGGTG AGTTTTCTCC TTCATTACAG 
1621 AAACGGCTTT TTCAA AAATA TGGTATTGAT AATCCTGATA TGAATAAATT GCAGTTTCAT 
1681 TTGATGCTCG ATGAGTTTTT CTAATCAGAA TTGGTTAATT GGTTGTAACA TTATTCAGAT 
1741 TGGGCCCCGT TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC TTCTTGAGAT 
1801 CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG C AAACA AAAA AACCACCGCT ACCAGCGGTG 
1861 GTTTGTTTGC CGGATCAAGA GCTACCAACT CI I I I ICCGA AGGTAACTGG CTTCAGCAGA 
1921 GCGCAGATAC CAAATACTGT TCTTCTAGTG TAGCCGTAGT TAGGCCACCA CTTCAAGMC 
1981 TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC TGCTGCCAGT 
2041 GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTACCGGA TAAGGCGCAG 
2101 CGGTCGGGCT GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC GACCTACACC 
2161 GAACTGAGAT ACCTACAGCG TGAGCTATGA GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG 
2221 GCGGACAGGT ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG GGAGCTTCCA 
2281 GGG GGAAA CG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG ACTTGAGCGT 
2341 CGAI I I I IGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG CAACGCGGCC 
2401 TTTTTACGGT TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTTCC TGCGTTATCC 
2461 CCTGATTCTG TGGATAACCG TATTACCGCT AGCATGGATC TCGGGGACGT CTAACTACTA 
2521 AGCGAGAGTA GGGAACTGCC AGGCATCAAA TAAAACGAAA GGCTCAGTCG GAAGACTGGG 
2581 CCTTTCGTTT TATCTGTTGT TTGTCGGTGA ACGCTCTCCT GAGTAGGACA AATCCGCCGG 
2641 GAGCGGATTT GAACGTTGTG AAGCAACGGC CCGGAGGGTG GCGGGCAGGA CGCCCGCCAT 
2701 AAACTGCCAG GCATCAAACT AAGCAGAAGG CCATC 
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CLONING SITES OF THE ENTRY VECTOR pENTR9 
Int ottLI 

goo qqc ctg tat ttt caa gga 



tac ooa ooa gca ggc ttt 
ttt cgt ccg aaa 



ctt ttg gac ata aaa gtt cct 



Leu Tyr Lys Lys Ala Gly Phe Glu Asn Leu Tyr Phe Gln^Gly 

TEV Protease 

Nde I Bglll Sail BamHI Kpnl EcoRI 

ca|t atg a |ga tc t g |tc qo c tg |g ate e gg toc| cg Ja ott cgc- — 

gtaljac tct agja cag ctjg acc tagjgcjc atg get taajgcg — 

His Met Arg Ser Vol Asp Trp lie Arg Tyr Arg He 



EcoRI Not I 



Death — aq |a att cgc [qqc cg c oc |t cqa 



Xhol EcoRV Xbal attL2 



— tct taa 



gcg ccg gbg tga get 




gat 
eta 



ot |c tog 



tag ate 



♦ 



acc cag 
tgg gtc 
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pENTR9 2735 bp 

Location (Base Nos.) Gene Encoded 

67.. 166 attLl 

339.. 644 ccdB 

673.. 772 attL2 

895.. 1704 KmR 

1809.. 2382 ori 

1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGA TTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTTTGA AAACCTGTAT 
181 TTTCAAGGAC ATATGAGATC TGTCGACTGG ATCCGGTACC GAATTCGCTT ACTAAAAGCC 
241 AGATAACAGT ATGCGTATTT GCGCGCTGAT TTTTGCGGTA TAAGAATATA TACTGATATG 
301 TATACCCGAA GTATGTCAAA AAGAGGTGTG CTTCTAGAAT GCAGTTTAAG GTTTACACCT 
361 ATAAAAGAGA GAGCCGTTAT CGTCTGTTTG TGGATGTACA GAGTGATATT ATTGACACGC 
421 CCGGGCGACG GATAGTGATC CCCCTGGCCA GTGCACGTCT GCTGTCAGAT AAAGTCTCCC 
481 GTGAACTTTA CCCGGTGGTG CATATCGGGG ATGAAAGCTG GCGCATGATG ACCACCGATA 
541 TGGCCAGTGT GCCGGTCTCC GTTATCGGGG AAGAAGTGGC TGATCTCAGC CACCGCGAAA 
601 ATGACATCAA AAACGCCATT AACCTGATGT TCTGGGGAAT ATAGAATTCG CGGCCGCACT 
661 CGAGATATCT AGACCCAGCT TTCTTGTACA AAGTTGGCAT TATAAGAAAG CATTGCTTAT 
721 CAATTTGTTG CAACGAACAG GTCACTATCA GTCAAAATAA AATCATTATT TGCCATCCAG 
781 CTGCAGCTCT GGCCCGTGTC TCAAAATCTC TGATGTTACA TTGCACAAGA TAAAAATATA 
841 TCATCATGAA CAATAAAACT GTCTGCTTAC ATAAACAGTA ATACAAGGGG TGTTATGAGC 
901 CATATTCAAC GGGAAACGTC GAGGCCGCGA TTAAATTCCA ACATGGATGC TGATTTATAT 
961 GGGTATAAAT GGGCTCGCGA TAATGTCGGG CAATCAGGTG CGACAATCTA TCGCTTGTAT 
1021 GGGAAGCCCG ATGCGCCAGA GTTGTTTCTG AAACATGGCA AAGGTAGCGT TGCCAATGAT 
1081 GTTAC AGAT G AGATGGTCAG ACTAAACTGG CTGACGGAAT TTATGCCTCT TCCGACCATC 
1141 AAGCATTTTA TCCGTACTCC TGATGATGCA TGGTTACTCA CCACTGCGAT CCCCGGAAAA 
1201 ACAGCATTCC AGGTATTAGA AGAATATCCT GATTCAGGTG AAAATATTGT TGATGCGCTG 
1261 GCAGTGTCCC TGCGCCGGTT GCATTCGATT CCTGTTTGTA ATTGTCCTTT TAACAGCGAT 
1321 CG CGTAT TTC GTCTCGCTCA GGCGCAATCA CGAATGAATA ACGGTTTGGT TGATGCGAGT 
1381 G ATTT TGATG ACGAGCGTAA TGGCTGGCCT GTTGAACAAG TCTGGAAAGA AATGCATAAA 
1441 C TTTTG CCAT TCTCACCGGA TTCAGTCGTC ACTCATGGTG ATTTCTCACT TGATAACCTT 
1501 Al I I I IGACG AGGGGAAATT AATAGGTTGT ATTGATGTTG GACGAGTCGG AATCGCAGAC 
1561 CGATACCAGG ATCTTGCCAT CCTATGGAAC TGCCTCGGTG AGTTTTCTCC TTCATTACAG 
1621 AAACGGCTTT TTCAAAAATA TGGTATTGAT AATCCTGATA TGAATAAATT GCAGTTTCAT 
1681 TTGATGCTCG ATGAGTTTTT CTAATCAGAA TTGGTTAATT GGTTGTAACA TTATTCAGAT 
1741 TG GGCCCCGT TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC TTCTTGAGAT 
1801 CCI I I I I I IC TGCGCGTAAT CTGCTGCTTG C AAACAA AAA AACCACCGCT ACCAGCGGTG 
1861 GTTTGTTTGC CGGATCAAGA GCTACCAACT CI I I I ICCGA AGGTAACTGG CTTCAGCAGA 
1921 GCGCAGATAC CAAATACTGT TCTTCTAGTG TAGCCGTAGT TAGGCCACCA CTTCAAGAAC 
1981 TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC TGCTGCCAGT 
2041 GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTACCGGA TAAGGCGCAG 
2101 CGGTCGGGCT GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC GACCTACACC 
2161 GAACTGAGAT ACCTACAGCG TGAGCTATGA GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG 
2221 GCGGACAGGT ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG GGAGCTTCCA 
2281 GGG GGAAA CG CCTGGTATCT TTATAGTCGT GTCGGGTTTC GCCACCTCTG ACTTGAGCGT 
2341 CGA1 I 1 I IGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG CAACGCGGCC 
2401 I I I I IACGGT TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTTCC TGCGTTATCC 
2461 CCTGATTCTG TGGATAACCG TATTACCGCT AGCATGGATC TCGGGGACGT CTAACTACTA 
2521 AGCGAGAGTA GGGAACTGCC AGGCATCAAA TAAAACGAAA GGCTCAGTCG GAAGACTGGG 
2581 CCTTTCGTTT TATCTGTTGT TTGTCGGTGA ACGCTCTCCT GAGTAGGACA AATCCGCCGG 
2641 GAGCGGATTT GAACGTTGTG AAGCAACGGC CCGGAGGGTG GCGGGCAGGA CGCCCGCCAT 
2701 AAACTGCCAG GCATCAAACT AAGCAGAAGG CCATC 

FIG.18B 
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CLONING SITES OF THE ENTRY VECTOR pENTRIO 

Int ottLI S.D. Nde 

t q toe ooo ooo gco ggc ttc gpp cto agg pop toe tta co|t^ 
ttt cgt ccg oog ctt gat tec ttt atg apt gta 



Leu Tyr Lys Lys Alo Gly Phe Glu Leu Arg Lys Tyr Leu His 



atg ggafacc 
lac cct tgg 



XmnI Sail BomHI Kpnl EcoRI 

aat tea jtc ga c tq |q ate e gg tacl c gja ott cgc 
tto agt cog ctjg ace tagjgcjc atg get taajgcg 



et Gly Thr Asn Ser Val Asp Trp He Arg Tyr Arg He 



EcoRI Not I 



Xhol 



EcoRV Xbol attL2 



Death — og |a ott cgc | ggc c gc ac |t ego gat 
(ccdB)— tct toajgeg ccg gjcg tga getjeta 



a tic tag 



tag ate 



acc cag 
tgg gtc 



Int 



ctt X&CAnVffil 



goo ogo aco tg 




FIG.19A 
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pENTRlO 



2738 bp 



Location (Base Nos.) 



67.. 166 
342 . . 647 
676..775 
898.. 1707 



Gene Encoded 
attLl 
ccdB 
attL2 



1812.. 2385 



KmR 
on* 



1 CTGACGGATG GCCI I 1 1 IGC G TTTCT ACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT 1 1 1 1 IATAAT GCCAACTTTG TACAAAAAAG CAGGCTTCGA ACTAAGGAAA 
181 TACTTACATA TGGGAACCAA TTCAGTCGAC TG GATCCG GT ACCGAATTCG CTTACTAAAA 
241 GCCAGATAAC AGTATGCGTA TTTGCGCGCT GATTTTTGCG GTATAAGAAT ATATACTGAT 
301 ATGTATACCC GAAGTATGTC AAAAAGAGGT GTGCTTCTAG AATGCAGTTT AAGGTTTACA 
361 CCTATAAAAG AGAGAGCCGT TATCGTCTGT TTGTGGATGT ACAGAGTGAT ATTATTGACA 
421 CGCCCGGGCG ACGGATGGTG ATCCCCCTGG CCAGTGCACG TCTGCTGTCA GATAAAGTCT 
481 CCCGTGAACT TTACCCGGTG GTGCATATCG GGGATGAAAG CTGGCGCATG ATGACCACCG 
541 ATATGGCCAG TGTGCCGGTC TCCGTTATCG GGGAAGAAGT GGCTGATCTC AGCCACCGCG 
601 AAAATGACAT CAAAAACGCC ATTAACCTGA TGTTCTGGGG AATATAGAAT TCGCGGCCGC 
661 ACTCGAGATA TCTAGACCCA GCTTTCTTGT ACAAAGTTGG CATTATAAGA AAGCATTGCT 
721 TATCAATTTG TTGCAACGAA CAGGTCACTA TCAGTCAAAA TAAAATCATT ATTTGCCATC 
781 CAGCTGCAGC TCTGGCCCGT GTCTCAAAAT CTCTGATGTT ACATTGCACA AGATAAAAAT 
841 ATATCATCAT GAACAATAAA ACTGTCTGCT TACATAAACA GTAATACAAG GGGTGTTATG 
901 AGCCATATTC AACGGGAAAC GTCGAGGCCG CGATTAAATT CCAACATGGA TGCTGATTTA 
961 TATGGGTATA AATGGGCTCG CGATAATGTC GGGCAATCAG GTGCGACAAT CTATCGCTTG 
1021 TATGGGAAGC CCGATGCGCC AGAGTTGTTT CTGAAACATG GCAAAGGTAG CGTTGCCAAT 
1081 GATGTTACAG ATGAGATGGT CAGACTAAAC TGGCTGACGG AATTTATGCC TCTTCCGACC 
1141 ATCAAGCATT TTATCCGTAC TCCTGATGAT GCATGGTTAC TCACCACTGC GATCCCCGGA 
1201 AAAACAGCAT TCCAGGTATT AGAAGAATAT CCTGATTCAG GTGAAAATAT TGTT GATGCG 
1261 CTGGCAGTGT TCCTGCGCCG GTTGCATTCG ATTCCTGTTT GTAATTGTCC TTTTAACAGC 
1321 GATCG CGTAT TTCGTCTCGC TCAGGCGCAA TCACGAATGA ATAACGGTTT GGTTGATGCG 
1381 AGTG ATTTT G ATGACGAGCG TAATGGCTGG CCTGTTGAAC AAGTCTGGAA AGAAATGCAT 
1441 AAAC TTTTG C CATTCTCACC GGATTCAGTC GTCACTCATG GTGATTTCTC ACTTGATAAC 
1501 CTTA I I I I I G ACGAGGGGAA ATTAATAGGT TGTATTGATG TTGGACGAGT CGGAATCGCA 
1561 GACCGATACC AGGATC TTGC CATCCTATGG AACTGCCTCG GTGAGTTTTC TCCTTCATTA 
1621 CAGAAACGGC TTTTTCAAAA ATATGGTATT GATAATCCTG ATATGAATAA ATTGCAGTTT 
1681 CATTTGATGC TCGATGAGTT TTTCTAATCA GAATTGGTTA ATTGGTTGTA ACATTATTCA 
1741 GATTG GGCCC CGTTCCACTG AGCGTCAGAC CCCGTAGAAA AGATCAAAGG ATCTTCTTGA 
1801 GATCC I I I I I TTCTGCGCGT AATCTGCTGC TTGC AAACAA AAAAACCACC GCTACCAGCG 
1861 GTGGTTTGTT TGCCGGATCA AGAGCTACCA ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC 
1921 AGAGCGCAGA TACCAAATAC TGTTCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG 
1981 AACTCTGTAG CACCGCCTAC ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC 
2041 AGTGGCGATA AGTCGTGTCT TACCGGGTTG GACTCAAGAC GATAGTTACC GGATAAGGCG 
2101 CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA GCTTGGAGCG AACGACCTAC 
2161 ACCGAACTGA GATACCTACA GCGTGAGCTA TGAGAAAGCG CCACGCTTCC CGAAGGGAGA 
2221 AAGGCGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG GAGAGCGCAC GAGGGAGCTT 
2281 CCAGGG GGAA ACGCCTGGTA TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG 
2341 CG TCGATTT T TGTGATGCTC GTCAGGGGGG CGGAGCCTAT GGAAAAACGC CAGCAACGCG 
2401 GCCI I I I IAC GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TCCTGCGTTA 
2461 TCCCCTGATT CTGTGGATAA CCGTATTACC GCTAGCATGG ATCTCGGGGA CGTCTAACTA 
2521 CTAAGCGAGA GTAG GGAACT GCCAGGCATC GAATAAAACG AAAGGCTCAG TCGGAAGACT 
2581 GGGCCTTTCG TTTTATCTGT TGTTTGTCGG TGAACGCTCT CCTGAGTAGG ACAAATCCGC 
2641 CGGGAGCGGA TTTGAACGTT GTGAAGCAAC GGCCCGGAGG GTGGCGGGCA GGACGCCCGC 
2701 CATAAACTGC CAGGCATCAA ACTAAGCAGA AGGCCATC 



FIG.19B 



30/253 



Int attLI 



CLONING SITES OF THE ENTRY VECTOR pENTRII 

S.D. Kozok XmnI 



S.D. 



T Tfc TAC AAA AAA GCA GGCllTC GAA GGA GAT AGA ACC 
AAC ATG TTTI TTT CGT CCG A AG CTT CCT CTA TCT TGG 



AAT TCT CTA AGG AAA TAC 
.TTA AGA GAT TCC TTT ATG 



Leu Tyr Lys Lys Ala Gly Phe Glu Gly Asp Arg Thr Asn Ser Leu Arg Lys Tyr 



Kozak Ncol Sail BamHI 



Kpnl EcoRI 



EcoRI 



Not I 



TTA AC E ATG G lTC GAC TG|3_ATC CGGJACJ CGjAATT C - 
AAT TGGHTA^CAGCijG ACC TAGjGCjC ATG GCT TAAjG 

Leu Thr Met Vol Asp Trp He Arg Tyr Arg. He 



- ccdB - - G lAAT TC G dGG CCG 
C TTA AX GCCGCjC 

Asn Ser Arg Pro 



Xhol EcoRV Xbal 



Int cttL2 



CAC | TCG A GA TjAT [CTA GK C CCA GCT T TjC TTG TAC AAA G 
GTG AGlHjCT ATA GAT C fG GGT CGA AAG AAC ATG I TTT C 

His Ser Arg Tyr Leu Asp Pro Ala Phe Leu Tyr Lys 



FIG.20A 
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pENTRll 2744 bp (rotated to position 2578) 

Location (Base Nos.) Gene Encoded 

67.. 166 attLl 

348.. 653 ccdB 

683.. 781 attl_2 

904.. 1713 KmR 

1818.. 2391 ori 

1 CTGACGGATG GCCTTT TTGC G TTTCT ACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT I 1 1 1 IATAAT GCCAACTTTG TACAAAAAAG CAGGCTTCGA AGGAGATAGA 
181 ACCAATTCTC TAAGGAAATA CTTAACCATG GTCGACTGGA TCCGGTACCG AATTCGCTTA 
241 CTAAAAGCCA GATAACAGTA TGCGTATTTG CGCGCTGATT TTTGCGGTAT AAGAATATAT 
301 ACTGATATGT ATACCCGAAG TATGTCAAAA AGAGGTGTGC TTCTAGAATG CAGTTTAAGG 
361 TTTACACCTA TAAAAGAGAG AGCCGTTATC GTCTGTTTGT GGATGTACAG AGTGATATTA 
421 TTGACACGCC CGGGCGACGG ATAGTGATCC CCCTGGCCAG TGCACGTCTG CTGTCAGATA 
481 AAGTCTCCCG TGAACTTTAC CCGGTGGTGC ATATCGGGGA TGAAAGCTGG CGCATGATGA 
541 CCACCGATAT GGCCAGTGTG CCGGTCTCCG TTATCGGGGA AGAAGTGGCT GATCTCAGCC 
601 ACCGCGAAAA TGACATCAAA AACGCCATTA ACCTGATGTT CTGGGGAATA TAGAATTCGC 
661 GGCCGCACTC GAGATATCTA GACCCAGCTT TCTTGTACAA AGTTGGCATT ATAAGAAAGC 
721 ATTGCTTATC AATTTGTTGC AACGAACAGG TCACTATCAG TCAAAATAAA ATCATTATTT 
781 GCCATCCAGC TGCAGCTCTG GCCCGTGTCT CAAAATCTCT GATGTTACAT TGCACAAGAT 
841 AAAAATATAT CATCATGAAC AATAAAACTG TCTGCTTACA TAAACAGTAA TACAAGGGGT 
901 GTTATGAGCC ATATTCAACG GGAAACGTCG AGGCCGCGAT TAAATTCCAA CATGGATGCT 
961 GATTTATATG GGTATAAATG GGCTCGCGAT AATGTCGGGC AATCAGGTGC GACAATCTAT 
1021 CGCTTGTATG GGAAGCCCGA TGCGCCAGAG TTGTTTCTGA AACATGGCAA AGGTAGCGTT 
1081 GCCAATGATG TTACAGATGA GATGGTCAGA CTAAACTGGC TGACGGAATT TATGCCTCTT 
1141 CCGACCATCA AGCATTTTAT CCGTACTCCT GATGATGCAT GGTTACTCAC CACTGCGATC 
1201 CCCGGAAAAA CAGCATTCCA GGTATTAGAA GAATATCCTG ATTCAGGTGA AAATATTGTT 
1261 GATGCGCTGG CAGTGTTCCT GCGCCGGTTG CATTCGATTC CTGTTTGTAA TTGTCCTTTT 
1321 AACAGCGATC GCGTATTTCG TCTCGCTCAG GCGCAATCAC GAATGAATAA CGGTTTGGTT 
1381 GATGCGAGTG ATTTTGATGA CGAGCGTAAT GGCTGGCCTG TTGAACAAGT CTGGAAAGAA 
1441 ATGCATAAAC TTTTG CCATT CTCACCGGAT TCAGTCGTCA CTCATGGTGA TTTCTCACTT 
1501 GATAACCTTA I I I I I GACGA GGGGAAATTA ATAGGTTGTA TTGATGTTGG A CGAGT CGGA 
1561 ATCGCAGACC GATACCAGGA TCTTGCCATC CTATGGAACT GCCTCGGTGA GTTTTCTCCT 
1621 TCATTACAGA AACGGCTTTT TCAA AAATAT GGTATTGATA ATCCTGATAT GAATAAATTG 
1681 CAGTTTCATT TGATGCTCGA TGAGTTTTTC TAATCAGAAT TGGTTAATTG GTTGTAACAT 
1741 TATTCAGATT GGGCCCCGTT CCACTGAGGG TCAGACCCCG TAGAAAAGAT CAAAGGATCT 
1801 TCTTGAGATC CI I I I I I ICT GCGCGTAATC TGCTGCTTGC AAACAA AAAA ACCACCGCTA 
1861 CCAGCGGTGG TTTGTTTGCC GGATCAAGAG CTACCAACTC TTTTTCCGAA GGTAACTGGC 
1921 TTCAGCAGAG CGCAGATACC AAATACTGTT CTTCTAGTGT AGCCGTAGTT AGGCCACCAC 
1981 TTCAAGAACT CTGTAGCACC GCCTACATAC CTCGCTCTGC TAATCCTGTT ACCAGTGGCT 
2041 GCTGCCAGTG GCGATAAGTC GTGTCTTACC GGGTTGGACT CAAGACGATA GTTACCGGAT 
2101 AAGGCGCAGC GGTCGGGCTG AACGGGGGGT TCGTGCACAC AGCCCAGCTT GGAGCGAACG 
2161 ACCTACACCG AACTGAGATA CCTACAGCGT GAGCTATGAG AAAGCGCCAC GCTTCCCGAA 
2221 GGGAGAAAGG CGGACAGGTA TCCGGTAAGC GGCAGGGTCG GAACAGGAGA GCGCACGAGG 
2281 GAGCTTCCAG GG GGAAAC GC CTGGTATCTT TATAGTCCTG TCGGGTTTCG CCACCTCTGA 
2341 CTTGAGCGTC GATTT TTGTG ATGCTCGTCA GGGGGGCGGA GCCTATGGAA AAACGCCAGC 
2401 AACGCGGCCT TTTTACGGTT CCTGGCCTTT TGCTGGCCTT TTGCTCACAT GTTCTTTCCT 
2461 GCGTTATCCC CTGATTCTGT GGATAACCGT ATTACCGCTA GCATGGATCT CGGGGACGTC 
2521 TAACTACTAA GCGAGA GTAG GGAACTGCCA GGCATCAAAT AAAACGAAAG GCTCAGTCGG 
2581 AAGACTGGGC CTTTCGTTTT ATCTGTTGTT TGTCGGTGAA CGCTCTCCTG AGTAGGACAA 
2641 ATCCGCCGGG AGCGGATTTG AACGTTGTGA AGCAACGGCC CGGAGGGTGG CGGGCAGGAC 
2701 GCCCGCCATA AACTGCCAGG CATCAAACTA AGCAGAAGGC CATC 

FIG.20B 
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NATIVE PROTEIN EXPRESSION IN E. COLI 



-35 



Trc PROMOTER 



-10 



mRNA 



atgagctgtt gaca 
toctcqQ qoa ctgt 



attaat catccggctc gtataatgtg tggaat tgtg agcggataac 
taatta gtaggccgag c |atottq coc accttaacac tcgcctattg 



Int ottRI 

ootttcococ oggoaocogo coggtotagg otcjcoogtt t |gt^pdffi j; 
ttaaagtgtg tcctttgtct gtccatatcc tog |tgttcoo ocatgtttf 
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pDESTl 6464 bp 



Location (Base Nos.) 
216. .257 
397.. 273 



647.. 1306 
1426. .1510 
1648 . . 1953 
1994.. 2118 
2598.. 3503 
4104.. 4264 
4504.. 4941 
5340 . . 6420 



Gene Encoded 
Trc promoter 
attRl 
CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 

ori 

flori (fl intergenic region) 
laclq 



1 GTTTGACAGC TTATCATCGA CTGCACGGTG CACCAATGCT TCTGGCGTCA GGCAGCCATC 
61 GGAAGCTGTG GTATGGCTGT GCAGGTCGTA AATCACTGCA TAATTCGTGT CGCTCAAGGC 
121 GCACTCCCGT TCTGGATAAT GTTTTTTGCG CCGACATCAT AACGGTTCTG GCAAATATTC 
181 TGAAATGAGC TGTTGACAAT TAATCATCCG GTCCGTATAA TCTGTGGAAT TGTGAGCGGG 
241 ATAACAATTT CATCGCGAGG TACCAAGCTA TCACAAGTTT GTACAAAAM GCTGAACGAG 
301 AAACGTAAAA TGATATAAAT ATCAATATAT TAAATTAGAT TTTGCATAAA AAACAGACTA 
361 CATAATACTG TAAAACACAA CATATCCAGT CACTATGGCG GCCGCTAAGT TGGCAGCATC 
421 ACCCGACGCA CTTTGCGCCG AATAAATACC TGTGACGGAA GAT CACTT CG CAGAATAAAT 
481 AAATCCTGGT GTCCCTGTTG ATACCGGGAA GCCCTGGGCC AACTTTTGGC GAAAATGAGA 
541 CGTTGATCGG CACGTAAGAG GTTCCAACTT TCACCATAAT GAAATAAGAT CACTACCGGG 
601 CGT AI I I I I I GAGTTATCGA GATTTTCAGG AGCTAAGGAA GCTAAAATGG AGAAAAAAAT 
661 CACTGGATAT ACCACCGTTG ATATATCCCA ATGGCATCGT AAAGAACATT TTGAGGCATT 
721 TCAGTCAGTT GCTCAATGTA CCTATAACCA GACCGTTCAG CTGGATATTA CGGCCTTTTT 
781 AAAGACCGTA AAGAAAAATA AGCACAAGTT TTATCCGGCC TTTATTCACA TTCTTGCCCG 
841 CCTGATGAAT GCTCATCCGG AATTCCGTAT GGCAATGAAA GACGGTGAGC TGGTGATATG 
901 GGATAGTGTT CACCCTTGTT ACACCGTTTT CCATGAGCAA ACTGAAACGT TTTCATCGCT 
961 CTGGAGTGAA TACCACGACG ATTTCCGGCA GTTTCTACAC ATATATTCGC A AGATGT GGC 
1021 GTGTTACGGT GAAAACCTGG CCTATTTCCC TAAAGGGTTT ATTGAGAATA TGTTTTTCGT 
1081 CTCAGCCAAT CCCTGGGTGA GTTTCACCAG TTTTGATTTA AACGTGGCCA ATATGGACAA 
1141 CTTCTTCGCC CCCGTTTTCA CCATGGGCAA ATATTATACG CAAGGCGACA AGGTGCTGAT 
1201 GCCGCTGGCG ATTCAGGTTC ATCATGCCGT CTGTGATGGC TTCCATGTCG GCAGAATGCT 
1261 TAATGAATTA CAACAGTACT GCGATGAGTG GCAGGGCGGG GC GTAAA CGC GTGGATCCGG 
1321 CTTACTAAAA GCCAGATAAC AGTATGCGTA TTTGCGCGCT GATTTTTGCG GTATAAGAAT 
1381 ATATACTGAT ATGTATACCC GAAGTATGTC AAAAAGAGGT GTGCTATGAA GCAGCGTATT 
1441 ACAGTGACAG TTGACAGCGA CAGCTATCAG TTGCTCAAGG CATATATGAT GTCAATATCT 
1501 CCGGTCTGGT AAGCACAACC ATGCAGAATG AAGCCCGTCG TCTGCGTGCC GAACGCTGGA 
1561 AAGCGGAAAA TCAGGAAGGG ATGGCTGAGG TCGCCCGGTT TATTGAAATG AACGGCTCTT 
1621 TTGCTGACGA GAACAGGGAC TGGTGAAATG CAGTTTAAGG TTTACACCTA TAAAAGAGAG 
1681 AGCCGTTATC GTCTGTTTGT GGATGTACAG AGTGATATTA TTGACACGCC CGGGCGACGG 
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1741 ATGGTGATCC CCCTGGCCAG TGCACGTCTG CTGTCAGATA AAGTCTCCCG TGAACTTTAC 
1801 CCGGTGGTGC ATATCGGGGA TGAAAGCTGG CGCATGATGA CCACCGATAT GGCCAGTGTG 
1861 CCGGTCTCCG TTATCGGGGA AGAAGTGGCT GATCTCAGCC ACCGCGAAAA TGACATCAAA 
1921 AACGCCATTA ACCTGATGTT CTGGGGAATA TAAATGTCAG GCTCCCTTAT ACACAGCCAG 
1981 TCTGCAGGTC GACCATAGTG ACTGGATATG TTGTGTTTTA CAGTATTATG TAGTCTGTTT 
2041 TTTATGCAAA ATCTAATTTA ATATATTGAT ATTTATATCA TTTTACGTTT CTCGTTCAGC 
2101 TTTCTTGTAC AAAGTGGTGA TAGCTTGGCT GTTTTGGCGG ATGAGAGAAG ATTTTCAGCC 
2161 TGATACAGAT TAAATCAGAA CGCAGAAGCG GTCTGATAAA ACAGAATTTG CCTGGCGGCA 
2221 GTAGCGCGGT GGTCCCACCT GACCCCATGC CGAACTCAGA AGTGAAACGC CGTAGCGCCG 
2281 ATGGTAGTGT GGGGTCTCCC CATGCGAGAG TAGGGAACTG CCAGGCATCA AATAAAACGA 
2341 AAGGCTCAGT CGAAAGACTG GGCCTTTCGT TTTATCTGTT GTTTGTCGGT GAACGCTCTC 
2401 CTGAGTAGGA CAAATCCGCC GGGAGCGGAT TTGAACGTTG CGAAGCAACG GCCCGGAGGG 
2461 TGGCGGGCAG GACGCCCGCC ATAAACTGCC AGGCATCAAA TTAAGCAGAA GGCCATCCTG 
2521 ACGGATGGCC TTTTTGCGTT TCTACAAACT CTTTTTGTTT ATTTTTCTAA ATACATTCAA 
2581 ATATGTATCC GCTCATGAGA CAATAACCCT GATAAATGCT TCAATAATAT TGAAAAAGGA 
2641 AGAGTATGAG TATTCAACAT TTCCGTGTCG CCCTTATTCC CI I I I I IGCG GCATTTTGCC 
2701 TTCCTGTTTT TGCTCACCCA GAAACGCTGG TGAAAGTAAA AGATGCTGAA GATCAGTTGG 
2761 GTGCACGAGT GGGTTACATC GAACTGGATC TCAACAGCGG TAAGATCCTT GAGAGTTTTC 
2821 GCCCCGAAGA ACGTTTTCCA ATGATGAGCA CTTTTAAAGT TCTGCTATGT GGCGCGGTAT 
2881 TATCCCGTGT TGACGCCGGG CAAGAGCAAC TCGGTCGCCG CATACACTAT TCTCAGAATG 
2941 ACTTGGTTGA GTACTCACCA GTCACAGAAA AGCATCTTAC GGATGGCATG ACAGTAAGAG 
3001 AATTATGCAG TGCTGCCATA ACCATGAGTG ATAACACTGC GGCCAACTTA CTTCTGACAA 
3061 CGATCGGAGG ACCGAAGGAG CTAACCGCTT TTTTGCACAA CATGGGGGAT CATGTAACTC 
3121 GCCTTGATCG TTGGGAACCG GAGCTGAATG AAGCCATACC AAACGACGAG CGTGACACCA 
3181 CGATGCCTAC AGCAATGGCA ACAACGTTGC GCAAACTATT AACTGGCGAA CTACTTACTC 
3241 TAGCTTCCCG GCAACAATTA ATAGACTGGA TGGAGGCGGA TAAAGTTGCA GGACCACTTC 
3301 TGCGCTCGGC CCTTCCGGCT GGCTGGTTTA TTGCTGATAA ATCTGGAGCC GGTGAGCGTG 
3361 GGTCTCGCGG TATCATTGCA GCACTGGGGC CAGATGGTAA GCCCTCCCGT ATCGTAGTTA 
3421 TCTACACGAC GGGGAGTCAG GCAACTATGG ATGAACGAAA TAGACAGATC GCTGAGATAG 
3481 GTGCCTCACT GATTAAGCAT TGGTAACTGT CAGACCAAGT TTACTCATAT ATACTTTAGA 
3541 TTGATTTAAA ACTTCATTTT TAATTTAAAA GGATCTAGGT GAAGATCCTT TTTGATAATC 
3601 TCATGACCAA AATCCCTTAA CGTGAGTTTT CGTTCCACTG AGCGTCAGAC CCCGTAGAAA 
3661 AGATCAAAGG ATCTTCTTGA GATCCTTTTT TTCTGCGCGT AATCTGCTGC TTGCAAACAA 
3721 AAAAACCACC GCTACCAGCG GTGGTTTGTT TGCCGGATCA AGAGCTACCA ACTCTTTTTC 
3781 CGAAGGTAAC TGGCTTCAGC AGAGCGCAGA TACCAAATAC TGTCCTTCTA GTGTAGCCGT 
3841 AGTTAGGCCA CCACTTCAAG AACTCTGTAG CACCGCCTAC ATACCTCGCT CTGCTAATCC 
3901 TGTTACCAGT GGCTGCTGCC AGTGGCGATA AGTCGTGTCT TACCGGGTTG GACTCAAGAC 
3961 GATAGTTACC GGATAAGGCG CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA 
4021 GCTTGGAGCG AACGACCTAC ACCGAACTGA GATACCTACA GCGTGAGCTA TGAGAAAGCG 
4081 CCACGCTTCC CGAAGGGAGA AAGGCGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG 
4141 GAGAGCGCAC GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA TCTTTATAGT CCTGTCGGGT 
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4201 TTCGCCACCT CTGACTTGAG CGTCGATTTT TGTGATGCTC GTGAGGGGGG CGGAGCCTAT 
4261 GGAAAAACGC CAGCAACGCG GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC 
4321 ACATGTTCTT TCCTGCGTTA TCCCCTGATT CTGTGGATAA CCGTATTACC GCCTTTGAGT 
4381 GAGCTGATAC CGCTCGCCGC AGCCGAACGA CCGAGCGCAG CGAGTCAGTG AGCGAGGAAG 
4441 CGGAAGAGCG CCTGATGCGG TATTTTCTCC TTACGCATCT GTGCGGTATT TCACACCGCA 
4501 TAATTTTGTT AAAATTCGCG TTAAATTTTT GTTAAATCAG CTCAI 1 1 I I I AACCAATAGG 
4561 CCGAAATCGG CAAAATCCCT TATAAATCAA AAGAATAGAC CGAGATAGGG TTGAGTGTTG 
4621 TTCCAGTTTG GAACAAGAGT CCACTATTAA AGAACGTGGA CTCCAACGTC AAAGGGCGAA 
4681 AAACCGTCTA TCAGGGCGAT GGCCCACTAC GTGAACCATC ACCCTAATCA AGI I I I I IGG 
4741 GGTCGAGGTG CCGTAAAGCA CTAAATCGGA ACCCTAAAGG GAGCCCCCGA TTTAGAGCTT 
4801 GACGGGGAAA GCCGGCGAAC GTGGCGAGAA AGGAAGGGAA GAAAGCGAAA GGAGCGGGCG 
4861 CTAGGGCGCT GGCAAGTGTA GCGGTCACGC TGCGCGTAAC CACCACACCC GCCGCGCTTA 
4921 ATGCGCCGCT ACAGGGCGCG TCCATTCGCC ATTCAGGCTG CTATGGTGCA CTCTCAGTAC 
4981 MTCTGCTCT GATGCCGCAT AGTTAAGCCA GTACCAGTCA CGTAGCGATA TCGGAGTGTA 
5041 TACACTCCGC TATCGCTACG TGACTGGGTC ATGGCTGCGC CCCGACACCC GCCAACACCC 
5101 GCTGACGCGC CCTGACGGGC TTGTCTGCTC CCGGCATCCG CTTACAGACA AGCTGTGACC 
5161 GTCTCCGGGA GCTGCATGTG TCAGAGGTTT TCACCGTCAT CACCGAAACG CGCGAGGCAG 
5221 CAGATCAATT CGCGCGCGAA GGCGAAGCGG CATGCATTTA CGTTGACACC ATCGAATGGT 
5281 GCAAAACCTT TCGCGGTATG GCATGATAGC GCCCGGAAGA GAGTCAATTC AGGGTGGTGA 
5341 ATGTGAAACC AGTAACGTTA TACGATGTCG CAGAGTATGC CGGTGTCTCT TATCAGACCG 
5401 TTTCCCGCGT GGTGAACCAG GCCAGCCACG TTTCTGCGAA AACGCGGGAA AAAGTGGAAG 
5461 CGGCGATGGC GGAGCTGAAT TACATTCCCA ACCGCGTGGC ACAACAACTG GCGGGCAAAC 
5521 AGTCGTTGCT GATTGGCGTT GCCACCTCCA GTCTGGCCCT GCACGCGCCG TCGCAAATTG 
5581 TCGCGGCGAT TAAATCTCGC GCCGATCAAC TGGGTGCCAG CGTGGTGGTG TCGATGGTAG 
5641 AACGAAGCGG CGTCGAAGCC TGTAAAGCGG CGGTGCACAA TCTTCTCGCG CAACGCGTCA 
5701 GTGGGCTGAT CATTAACTAT CCGCTGGATG ACCAGGATGC CATTGCTGTG GMGCTGCCT 
5761 GCACTAATGT TCCGGCGTTA TTTCTTGATG TCTCTGACCA GACACCCATC AACAGTATTA 
5821 TTTTCTCCCA TGAAGACGGT ACGCGACTGG GCGTGGAGCA TCTGGTCGCA TTGGGTCACC 
5881 AGCAAATCGC GCTGTTAGCG GGCCCATTAA GTTCTGTCTC GGCGCGTCTG CGTCTGGCTG 
5941 GCTGGCATAA ATATCTCACT CGCAATCAAA TTCAGCCGAT AGCGGAACGG GAAGGCGACT 
6001 GGAGTGCCAT GTCCGGTTTT CAACAAACCA TGCAAATGCT GAATGAGGGC ATCGTTCCCA 
6061 CTGCGATGCT GGTTGCCAAC GATCAGATGG CGCTGGGCGC AATGCGCGCC ATTACCGAGT 
6121 CCGGGCTGCG CGTTGGTGCG GATATCTCGG TAGTGGGATA CGACGATACC GAAGACAGCT 
6181 CATGTTATAT CCCGCCGTTA ACCACCATCA AACAGGATTT TCGCCTGCTG GGGCAAACCA 
6241 GCGTGGACCG CTTGCTGCAA CTCTCTCAGG GCCAGGCGGT GAAGGGCAAT CAGCTGTTGC 
6301 CCGTCTCACT GGTGAAAAGA AAAACCACCC TGGCACCCAA TACGCAAACC GCCTCTCCCC 
6361-GCGCGTTGGC CGATTCATTA ATGCAGCTGG CACGACAGGT TTCCCGACTG GAAAGCGGGC 
6421 AGTGAGCGCA ACGCAATTAA TGTGAGTTAG CGCGAATTGA TCTG 
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His6 FUSIONS IN E. COLI 



-35 



Trc PROMOTER 



970 oat att ctg aaa tga get dtt goc oat too tco tec ggt ccg 



tta taa gac ttt act cga daa ctg t 



ta att agt agg cca ggc 




Met Ser Tyr 
atg teg tac 

acc tta oca etc gee tat tgt taa agt gtg tec ttt gtc tgg toe age otg 



r— ~RNA 

1021 tgg I aat tgt gag egg ata aca att tea cac agg aaa cag acc atg teg tac 



Tyr His His His His His His Gly He Thr Ser Int ottRI 
1072 tac cat cac cat cac cat cac ggc ate |aca agt ttg'^/apd 
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pDEST2 6553 bp 



Location (Base Nos.) 
912. .962 



1223.. 1009 
1473.. 2 132 
2252.. 2336 
2474.. 2779 
2820 . . 2944 
3509.. 4414 
5015.. 5175 
5415.. 5852 
6225 . . 752 



Gene Encoded 

Trc 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 

ori 

flori (fl intergenic region) 
laclq 



1 GGCGGTGCAC AATCTTCTCG CGCAACGCGT CAGTGGGCTG ATCATTAACT ATCCGCTGGA 
61 TGACCAGGAT GCCATTGCTG TGGAAGCTGC CTGCACTAAT GTTCCGGCGT TATTTCTTGA 
121 TGTCTCTGAC CAGACACCCA TCAACAGTAT TATTTTCTCC CATGAAGACG GTACGCGACT 
181 GGGCGTGGAG CATCTGGTCG CATTGGGTCA CCAGCAAATC GCGCTGTTAG CGGGCCCATT 
241 AAGTTCTGTC TCGGCGCGTC TGCGTCTGGC TGGCTGGCAT AAATATCTCA CTCGCAATCA 
301 AATTCAGCCG ATAGCGGAAC GGGAAGGCGA CTGGAGTGCC ATGTCCGGTT TTCAACAAAC 
361 CATGCAAATG CTGAATGAGG GCATCGTTCC CACTGCGATG CTGGTTGCCA ACGATCAGAT 
421 GGCGCTGGGC GCAATGCGCG CCATTACCGA GTCCGGGCTG CGCGTTGGTG CGGATATCTC 
481 GGTAGTGGGA TACGACGATA CCGAAGACAG CTCATGTTAT ATCCCGCCGT CAACCACCAT 
541 CAAACAGGAT TTTCGCCTGC TGGGGCAAAC CAGCGTGGAC CGCTTGCTGC AACTCTCTCA 
601 GGGCCAGGCG GTGAAGGGCA ATCAGCTGTT GCCCGTCTCA CTGGTGAAAA GAAAAACCAC 
661 CCTGGCACCC AATACGCAAA CCGCCTCTCC CCGCGCGTTG GCCGATTCAT TAATGCAGCT 
721 GGCACGACAG GTTTCCCGAC TGGAAAGCGG GCAGTGAGCG CAACGCAATT AATGTGAGTT 
781 AGCGCGAATT GATCTGGTTT GACAGCTTAT CATCGACTGC ACGGTGCACC AATGCTTCTG 
841 GCGTCAGGCA GCCATCGGAA GCTGTGGTAT GGCTGTGCAG GTCGTAAATC ACTGCATAAT 
901 TCGTGTCGCT CAAGGCGCAC TCCCGTTCTG GATAATGTTT TTTGCGCCGA CATCATAACG 
961 GTTCTGGCAA ATATTCTGAA ATGAGCTGTT GACAATTAAT CATCCGGTCC GTATAATCTG 
1021 TGGAATTGTG AGCGGATAAC AATTTCACAC AGGAAACAGA CCATGTCGTA CTACCATCAC 
1081 CATCACCATC ACGGCATCAC AAGTTTGTAC AAAAAAGCTG AACGAGAAAC GTAAAATGAT 
1141 ATAAATATCA ATATATTAAA TTAGATTTTG CATAAAAAAC AGACTACATA ATACTGTAAA 
1201 ACACAACATA TCCAGTCACT ATGGCGGCCG CTAAGTTGGC AGCATCACCC GACGCACTTT 
1261 GCGCCGAATA AATACCTGTG ACGGAAGATC ACTTCGCAGA ATAAATAAAT CCTGGTGTCC 
1321 CTGTTGATAC CGGGAAGCCC TGGGCCAACT TTTGGCGAAA ATGAGACGTT GATCGG CACG 
1381 TAAGAGGTTC CAACTTTCAC CATAATGAAA TAAGATCACT ACCGGGCGTA I I I I I IGAGT 
1441 TATCGAGATT TTCAGGAGCT AAGGAAGCTA AAATGGAGAA AAAAATCACT GGATATACCA 
1501 CCGTTGATAT ATCCCAATGG CATCGTAAAG AACATTTTGA GGCATTTCAG TCAGTTGCTC 
1561 AATGTACCTA TAACCAGACC GTTCAGCTGG ATATTACGGC CTTTTTAAAG ACCGTAAAGA 
1621 AAAATAAGCA CAAGTTTTAT CCGGCCTTTA TTCACATTCT TGCCCGCCTG ATGAATGCTC 
1681 ATCCGGAATT CCGTATGGCA ATGAAAGACG GTGAGCTGGT GATATGGGAT AGTGTTCACC 
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1741 CTTGTTACAC CGTTTTCCAT GAGCAAACTG 
1801 ACGACGATTT CCGGCAGTTT CTACACATAT 
1861 ACCTGGCCTA TTTCCCTAAA GGGTTTATTG 
1921 GGGTGAGTTT CACCAGTTTT GATTTAAACG 
1981 TTTTCACCAT GGGCAAATAT TATACGCAAG 
2041 AGGTTCATCA TGCCGTCTGT GATGGCTTCC 
2101 AGTACTGCGA TGAGTGGCAG GGCGGGGCGT 
2161 GATAACAGTA TGCGTATTTG CGCGCTGATT 
2221 ATACCCGAAG TATGTCAAAA AGAGGTGTGC 
2281 CAGCGACAGC TATCAGTTGC TCAAGGCATA 
2341 ACAACCATGC AGAATGAAGC CCGTCGTCTG 
2401 GAAGGGATGG CTGAGGTCGC CCGGTTTATT 
2461 AGGGACTGGT GAAATGCAGT TTAAGGTTTA 
2521 GTTTGTGGAT GTACAGAGTG ATATTATTGA 
2581 GGCCAGTGCA CGTCTGCTGT CAGATAAAGT 
2641 CGGGGATGAA AGCTGGCGCA TGATGACCAC 
2701 CGGGGAAGAA GTGGCTGATC TCAGCCACCG 
2761 GATGTTCTGG GGAATATAAA TGTCAGGCTC 
2821 ATAGTGACTG GATATGTTGT GTTTTACAGT 
2881 AATTTAATAT ATTGATATTT ATATCATTTT 
2941 TGGTGATGCC CATATGGGAA TTCAAAGGCC 
3001 TTCTAGAGGA TCCCTCGAGG CATGCGGTAC 
3061 GATTTTCAGC CTGATACAGA TTAAATCAGA 
3121 GCCTGGCGGC AGTAGCGCGG TGGTCCCACC 
3181 CCGTAGCGCC GATGGTAGTG TGGGGTCTCC 
3241 AAATAAAACG AAAGGCTCAG TCGAAAGACT 
3301 TGAACGCTCT CCTGAGTAGG ACAAATCCGC 
3361 GGCCCGGAGG GTGGCGGGCA GGACGCCCGC 
3421 AGGCCATCCT GACGGATGGC CTTTTTGCGT 
3481 AATACATTCA AATATGTATC CGCTCATGAG 
3541 TTGAAAAAGG AAGAGTATGA GTATTCAACA 
3601 GGCATTTTGC CTTCCTGTTT TTGCTCACGC 
3661 AGATCAGTTG GGTGCACGAG TGGGTTACAT 
3721 TGAGAGTTTT CGCCCCGAAG AACGTTTTCC 
3781 TGGCGCGGTA TTATCCCGTG TTGACGCCGG 
3841 TTCTCAGAAT GACTTGGTTG AGTACTCACC 
3901 GACAGTAAGA GAATTATGCA GTGCTGCCAT 
3961 ACTTCTGACA ACGATCGGAG GACCGAAGGA 
4021 TCATGTAACT CGCCTTGATC GTTGGGAACC 
4081 GCGTGACACC ACGATGCCTA CAGCAATGGC 
4141 ACTACTTACT CTAGCTTCCC GGCAACAATT 



AAACGTTTTC ATCGCTCTGG AGTGAATACC 
ATTCGCAAGA TGTGGCGTGT TACGGTGAAA 
AGAATATGTT TTTCGTCTCA GCCAATCCCT 
TGGCCAATAT GGACAACTTC TTCGCCCCCG 
GCGACAAGGT GCTGATGCCG CTGGCGATTC 
ATGTCGGCAG AATGCTTAAT GAATTACAAC 
AAACGCGTGG ATCCGGCTTA CTAAAAGCCA 
TTTGCGGTAT AAGAATATAT ACTGATATGT 
TATGAAGCAG CGTATTACAG TGACAGTTGA 
TATGATGTCA ATATCTCCGG TCTGGTAAGC 
CGTGCCGAAC GCTGGAAAGC GGAAAATCAG 
GAAATGAACG GCTCTTTTGC TGACGAGAAC 
CACCTATAAA AGAGAGAGCC GTTATCGTCT 
CACGCCCGGG CGACGGATGG TGATCCCCCT . 
CTCCCGTGAA CTTTACCCGG TGGTGCATAT 
CGATATGGCC AGTGTGCCGG TCTCCGTTAT 
CGAAAATGAC ATCAAAAACG CCATTAACCT 
CCTTATACAC AGCCAGTCTG CAGGTCGACC 
ATTATGTAGT CTGI 1 1 I I IA TGCAAAATCT 
ACGTTTCTCG TTCAGCTTTC TTGTACAAAG 
TACGTCGACG AGCTCACTAG TCGCGGCCGC 
CAAGCTTGGC TGTTTTGGCG GATGAGAGAA 
ACGCAGAAGC GGTCTGATAA AACAGAATTT 
TGACCCCATG CCGAACTCAG AAGTGAAACG 
CCATGCGAGA GTAGGGAACT GCCAGGCATC 
GGGCCTTTCG TTTTATCTGT TGTTTGTCGG 
CGGGAGCGGA TTTGAACGTT GCGAAGCAAC 
CATAAACTGC CAGGCATCAA ATTAAGCAGA 
TTCTACAAAC TCTTTTTGTT TATTTTTCTA 
ACAATAACCC TGATAAATGC TTCAATAATA 
TTTCCGTGTC GCCCTTATTC CCI I I I I IGC 
AGAAACGCTG GTGAAAGTAA AAGATGCTGA 
CGAACTGGAT CTCAACAGCG GTAAGATCCT 
AATGATGAGC ACTTTTAAAG TTCTGCTATG 
GCAAGAGCAA CTCGGTCGCC GCATACACTA 
AGTCACAGAA AAGCATCTTA CGGATGGCAT 
AACCATGAGT GATAACACTG CGGCCAACTT 
GCTAACCGCT TTTTTGCACA ACATGGGGGA 
GGAGCTGAAT GAAGCCATAC CAAACGACGA 
AACAACGTTG CGCAAACTAT TAACTGGCGA 
AATAGACTGG ATGGAGGCGG ATAAAGTTGC 
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4201 AGGACCACTT CTGCGCTCGG CCCTTCCGGC TGGCTGGTTT ATTGCTGATA AATCTGGAGC 
4261 CGGTGAGCGT GGGTCTCGCG GTATCATTGC AGCACTGGGG CCAGATGGTA AGCCCTCCCG 
4321 TATCGTAGTT ATCTACACGA CGGGGAGTCA GGCAACTATG GATGAACGAA ATAGACAGAT 
4381 CGCTGAGATA GGTGCCTCAC TGATTAAGCA TTGGTAACTG TCAGACCAAG TTTACTCATA 
4441 TATACTTTAG ATTGATTTAA AACTTCATTT TTAATTTAAA AGGATCTAGG TGAAGATCCT 
4501 TTTTGATAAT CTCATGACCA AAATCCCTTA ACGTGAGTTT TCGTTCCACT GAGCGTCAGA 
4561 CCCCGTAGAA AAGATCAAAG GATCTTCTTG AGATCCTTTT TTTCTGCGCG TAATCTGCTG 
4621 CTTGCAAACA AAAAAACCAC CGCTACCAGC GGTGGTTTGT TTGCCGGATC AAGAGCTACC 
4681 MCTCTTTTT CCGAAGGTAA CTGGCTTCAG CAGAGCGCAG ATACCAAATA CTGTCCTTCT 
4741 AGTGTAGCCG TAGTTAGGCC ACCACTTCAA GAACTCTGTA GCACCGCCTA CATACCTCGC 
4801 TCTGCTAATC CTGTTACCAG TGGCTGCTGC CAGTGGCGAT AAGTCGTGTC TTACCGGGTT 
4861 GGACTCAAGA CGATAGTTAC CGGATAAGGC GCAGCGGTCG GGCTGAACGG GGGGTTCGTG 
4921 CACACAGCCC AGCTTGGAGC GAACGACCTA CACCGAACTG AGATACCTAC AGCGTGAGCT 
4981 ATGAGAAAGC GCCACGCTTC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG TAAGCGGCAG 
5041 GGTCGGAACA GGAGAGCGCA CGAGGGAGCT TCCAGGGGGA AACGCCTGGT ATCTTTATAG 
5101 TCCTGTCGGG TTTCGCCACC TCTGACTTGA GCGT CGATTT TTGTGATGCT CGTCAGGGGG 
5161 GCGGAGCCTA TGGAAAAACG CCAGCAACGC GGCCTTTTTA CGGTTCCTGG CCTTTTGCTG 
5221 GCCTTTTGCT CACATGTTCT TTCCTGCGTT ATCCCCTGAT TCTGTGGATA ACCGTATTAC 
5281 CGCCTTTGAG TGAGCTGATA CCGCTCGCCG CAGCCGAACG ACCGAGCGCA GCGAGTCAGT 
5341 GAGCGAGGAA GCGGAAGAGC GCCTGATGCG GTATTTTCTC CTTACGCATC TGTGC GGTAT 
5401 TTCACACCGC ATAATTTTGT TAAAATTCGC GTTAAATTTT TGTTAAATCA GCTCATTTTT 
5461 TAACCAATAG GCCGAAATCG GCAAAATCCC TTATAAATCA AAAGAATAGA CCGAGATAGG 
5521 GTTGAGTGTT GTTCCAGTTT GGMCAAGAG TCCACTATTA AAGAACGTGG ACTCCAACGT 
5581 CAAAGGGCGA AAAACCGTCT ATCAGGGCGA TGGCCCACTA CGTGAACCAT CACCCTAATC 
5641 AAGTTTTTTG GGGTCGAGGT GCCGTAAAGC ACTAAATCGG AACCCTAAAG GGAGCCCCCG 
5701 ATTTAGAGCT TGACGGGGAA AGCCGGCGAA CGTGGCGAGA AAGGAAGGGA AGAAAGCGAA 
5761 AGGAGCGGGC GCTAGGGCGC TGGCAAGTGT AGCGGTCACG CTGCGCGTAA CCACCACACC 
5821 CGCCGCGCTT AATGCGCCGC TACAGGGCGC GTCCCATTCG CCATTCAGGC TGCTATGGTG 
5881 CACTCTCAGT ACAATCTGCT CTGATGCCGC ATAGTTAAGC CAGTATACAC TCCGCTATCG 
5941 CTACGTGACT GGGTCATGGC TGCGCCCCGA CACCCGCCAA CACCCGCTGA CGCGCCCTGA 
6001 CGGGCTTGTC TGCTCCCGGC ATCCGCTTAC AGACAAGCTG TGACCGTCTC CGGGAGCTGC 
6061 ATGTGTCAGA GGTTTTCACC GTCATCACCG AAACGCGCGA GGCAGCAGAT CAATTCGCGC 
6121 GCGAAGGCGA AGCGGCATGC ATTTACGTTG ACACCATCGA ATGGTGCAAA ACCTTTCGCG 
6181 GTATGGCATG ATAGCGCCCG GAAGAGAGTC AATTCAGGGT GGTGAATGTG AAACCAGTAA 
6241 CGTTATACGA TGTCGCAGAG TATGCCGGTG TCTCTTATCA GACCGTTTCC CGCGTGGTGA 
6301 ACCAGGCCAG CCACGTTTCT GCGAAAACGC GGGAAAAAGT GGAAGCGGCG ATGGCGGAGC 
6361 TGAATTACAT TCCCAACCGC GTGGCACAAC AACTGGCGGG CAAACAGTCG TTGCTGATTG 
6421 GCGTTGCCAC CTCCAGTCTG GCCCTGCACG CGCCGTCGCA AATTGTCGCG GCGATTAAAT 
6481 CTCGCGCCGA TCAACTGGGT GCCAGCGTGG TGGTGTCGAT GGTAGAACGA AGCGGCGTCG 
6541 AAGCCTGTAA AGC 
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GST FUSIONS IN E. COLf 

_-35 

154 egg ttc tgg caa ata ttc tga aat gag ctg 
gec oog occ gtt tot oog oct tta etc goc 



Trc PROMOTER 



ttg aca att aat cat egg etc 
aac tgt too tta gta gec gag 



-10 



mRNA 



205 g o too t()t gtg ajaa ttg tga gcg gat oac aat ttc aca cag gaa aca gta 
c:t att a:a cac ctt aac act cgc eta ttg tta aag tgt gtc ctt tgt cat 



M S P I L — «~GST 

256 ttc atg tec cct ata eta ggt tat tgg aaa att aag ggc ctt gtg caa ccc 
aag tac agg gga tat gat cca ata ace ttt taa ttc ccg gaa cac gtt ggg 
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pDEST3 6823 bp 

I nr.at.ion (Base Nos.) Gene Encoded 

150.. 200 Trc 

1087.. 963 attRl 

1337 . . 1996 CmR 

2116.. 2200 inactivated ccdA 

2338.-2643 ccdB 

2684.. 2808 attR2 

3231.. 4091 ampR 

5295..6254 laclq 

1 ACGTTATCGA CTGCACGGTG CACCAATGCT TCTGGCGTCA GGCAGCCATC GGAAGCTGTG 
61 GTATGGCTGT GCAGGTCGTA AATCACTGCA TAATTCGTGT CGCTCAAGGC GCACTCCCGT 
121 TCTGGATAAT GTTTTTTGCG CCGACATCAT AACGGTTCTG GCAAATATTC TGAAATGAGC 
\fi ™^ taaTJaIcgg CTCGTATAAT GTGTGGAATT GTGAGCGGAT MCAATTTCA 
241 CACAGGAAAC AGTATTCATG TCCCCTATAC TAGGTTATTG GAAAATTAAG GGCCTTGTGC 
301 MCCCACTCG ACTTCTTTTG GAATATCTTG AAGAAAAATA TGAAGAGCAT TTGTATGAGC 
GCGATGAAGG TGATAAATGG CGAAACAAAA AGTTTGAATT GGGTTTGGAG TTTCCCAATC 
Al TTCCTTATTA TATTGATGGT GATGTTAAAT TAACACAGTC TATGGCCATC ATACGTTATA 
481 TAGCTGACAA GCACAACATG TTGGGTGGTT GTCCAAAAGA GCGTGCAGAG ATTTCAATGC 
541 TTGAAGGAGC GGTTTTGGAT ATTAGATACG GTGTTTCGAG AATTGCATAT AGTAAAGACT 
601 TTGAAACTCT CAAAGTTGAT TTTCTTAGCA AGCTACCTGA AATGCTGAAA ATGTTCGMG 
661 ATCGTTTATG TCATAAAACA TATTTAAATG GTGATCATGT AACCCATCCT GACTTCATGT 
721 TGTATGACGC TCTTGATGTT GTTTTATAGA TGGACCCAAT GTGCCTGGAT GCGTTCCCAA 
781 AATTAGTTTG TTTTAAAAAA CGTATTGAAG CTATCCCACA AATTGATAAG TACTTGAAAT 
841 CCAGCAAGTA TATAGCATGG CCTTTGCAGG GCTGGCAAGC CACGTTTGGT GGTGGCGACC 
901 ATCCTCCAAA ATCGGATCTG GTTCCGCGTG GATCTCGTCG TGCATCTGTT GGATCCCCAT 
961 CAACAAGTTT GTACAAAAAA GCTGAACGAG AAACGTAAAA TGATATAAAT ATCAATATAT 
1021 TAAATTAGAT TTTGCATAAA AAACAGACTA CATAATACTG TAAAACACAA CATATCCAGT 
1081 CACTATGGCG GCCGCTAAGT TGGCAGCATC ACCCGACGCA CTTTGCGCCG AATAAATACC 
1141 TGTGACGGAA GATCACTTCG CAGAATAAAT AAATCCTGGT GTCCCTGTTG ATACCGGGAA 
1201 GCCCTGGGCC AACTTTTGGC GAAAATGAGA CGTT GATCGG CACGTAAGAG GTTCCAACTT 
1261 TCACCATAAT GAAATAAGAT CACTACCGGG CGTAI I I I I I GAGTTATCGA GATTTTCAGG 
1321 AGCTAAGGAA GCTAAAATGG AGAAAAAAAT CACTGGATAT ACCACCGTTG ATATATCCCA 
1381 ATGGCATCGT AAAGAACATT TTGAGGCATT TCAGTCAGTT GCTCAATGTA CCTATAACCA 
1441 GACCGTTCAG CTGGATATTA CGGCCTTTTT AAAGACCGTA AAGAAAAATA AGCACAAGTT 
1501 TTATCCGGCC TTTATTCACA TTCTTGCCCG CCTGATGAAT GCTCATCCGG AATTCCGTAT 
1561 GGCAATGAAA GACGGTGAGC TGGTGATATG GGATAGTGTT CACCCTTGTT ACACCGTTTT 
1621 CCATGAGCAA ACTGAAACGT TTTCATCGCT CTGGAGTGAA TACCACGACG ATTTCCGGCA 
1681 GTTTCTACAC ATATATTCGC AAGATGTGGC GTGTTACGGT GAAAACCTGG CCTATTTCCC 
1741 TAAAGGGTTT ATTGAGAATA TGTTTTTCGT CTCAGCCAAT CCCTGGGTGA GTTTCACCAG 
1801 TTTTGATTTA AACGTGGCCA ATATGGACAA CTTCTTCGCC CCCGTTTTCA CCATGGGCAA 
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1861 ATATTATACG CAAGGCGACA AGGTGCTGAT GCCGCTGGCG ATTCAGGTTC ATCATGCCGT 
1921 CTGTGATGGC TTCCATGTCG GCAGAATGCT TAATGAATTA CAACAGTACT GCGATGAGTG 
1981 GCAGGGCGGG GCGTAAAGAT CTGGATCCGG CTTACTAAAA GCCAGATAAC AGTATGCGTA 
2041 TTTGCGCGCT GATTTTTGCG GTATAAGAAT ATATACTGAT ATGTATACCC GAAGTATGTC 
2101 AAAAAGAGGT GTGCTATGAA GCAGCGTATT ACAGTGACAG TTGACAGCGA CAGCTATCAG 
2161 TTGCTCAAGG CATATATGAT GTCAATATCT CCGGTCTGGT AAGCACAACC ATGCAGAATG 
2221 AAGCCCGTCG TCTGCGTGCC GAACGCTGGA AAGCGGAAAA TCAGGAAGGG ATGGCTGAGG 
2281 TCGCCCGGTT TATTGAAATG AACGGCTCTT TTGCTGACGA GAACAGGGAC TGGTGAAATG 
2341 CAGTTTAAGG TTTACACCTA TAAAAGAGAG AGCCGTTATC GTCTGTTTGT GGATGTACAG 
2401 AGTGATATTA TTGACACGCC CGGGCGACGG ATGGTGATCC CCCTGGCCAG TGCACGTCTG 
2461 CTGTCAGATA AAGTCTCCCG TGAACTTTAC CCGGTGGTGC ATATCGGGGA TGAAAGCTGG 
2521 CGCATGATGA CCACCGATAT GGCCAGTGTG CCGGTCTCCG TTATCGGGGA AGAAGTGGCT 
2581 GATCTCAGCC ACCGCGAAAA TGACATCAAA AACGCCATTA ACCTGATGTT CTGGGGAATA 
2641 TAAATGTCAG GCTCCCTTAT ACACAGCCAG TCTGCAGGTC GACCATAGTG ACTGGATATG 
2701 TTGTGTTTTA CAGTATTATG TAGTCTGTTT TTTATGCAAA ATCTAATTTA ATATATTGAT 
2761 ATTTATATCA TTTTACGTTT CTCGTTCAGC TTTCTTGTAC AAAGTGGTTG ATGGGAATTC 
2821 ATCGTGACTG ACTGACGATC TGCCTCGCGC GTTTCGGTGA TGACGGTGAA AACCTCTGAC 
2881 ACATGCAGCT CCCGGAGACG GTCACAGCTT GTCTGTAAGC GGATGCCGGG AGCAGACAAG 
2941 CCCGTCAGGG CGCGTCAGCG GGTGTTGGCG GGTGTCGGGG CGCAGCCATG ACCCAGTCAC 
3001 GTAGCGATAG CGGAGTGTAT AATTCTTGAA GACGAAAGGG CCTCGTGATA CGCCTATTTT 
3061 TATAGGTTAA TGTCATGATA ATAATGGTTT CTTAGACGTC AGGTGGCACT TTTCGGGGAA 
3121 ATGTGCGCGG AACCCCTATT TGTTTATTTT TCTAAATACA TTCAAATATG TATCCGCTCA 
3181 TGAGACAATA ACCCTGATAA ATGCTTCAAT AATATTGAAA AAGGAAGAGT A TGAGTA TTC 
3241 AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT TTGCCTTCCT GTTTTTGCTC 
3301 ACCCAGAAAC GCTGGTGAAA GTAAAAGATG CTGAAGATCA GTTGGGTGCA CGAGTGGGTT 
3361 ACATCGAACT GGATCTCAAC AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT 
3421 TTCCAATGAT GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG 
3481 CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG GTTGAGTACT 
3541 CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT AAGAGAATTA TGCAGTGCTG 
3601 CCATAACCAT GAGTGATAAC ACTGCGGCCA ACTTACTTCT GACAACGATC GGAGGACCGA 
3661 AGGAGCTAAC CGCTTTTTTG CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG 
3721 AACCGGAGCT GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 
3781 TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT TCCCGGCAAC 
3841 AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC ACTTCTGCGC TCGGCCCTTC 
3901 CGGCTGGCTG GTTTATTGCT GATAAATCTG GAGCCGGTGA GCGTGGGTCT CGCGGTATCA 
3961 TTGCAGCACT GGGGCCAGAT GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA 
4021 GTCAGGCAAC TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 
4081 AGCATTGGTA ACTGTCAGAC CAAGTTTACT CATATATACT TTAGATTGAT TTAAAACTTC 
4141 ATTTTTAATT TAAAAGGATC TAGGTGAAGA TCCTTTTTGA TAATCTCATG ACCAAAATCC 
4201 CTTAACGTGA GTTTTCGTTC CACTGAGCGT CAGACCCCGT AGAAAAGATC AAAGGATCTT 
4261 CTTGAGATCC I I I I I I ICTG CGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC 
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4321 CAGCGGTGGT TTGTTTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAACTGGCT 
4381 TCAGCAGAGC GCAGATACCA AATACTGTCC TTCTAGTGTA GCCGTAG™ GGCCACCACT 
4441 TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTG 
4501 CTGCCAGTGG CGATAAGTCG TGTCTTACCG GGTTGGACTC AAGACGATAG TTACCGGATA 
4561 AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT CGTGCACACA GCCCAGCTTG GAGCGAACGA 
4621 CCTACACCGA ACTGAGATAC CTACAGCGTG AGCTATGAGA AAGCGCCACG CTTCCCGAAG 
4681 GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG CGCACGAGGG 
4741 AGCTTCCAGG GGGAAACGCC TGGTATCTTT ATAGTCCTGT CGGGTTTCGC CACCTCTGAC 
4801 TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG CCTATGGAAA AACGCCAGCA 
4861 ACGCGGCCTT TTTACGGTTC CTGGCCTTTT GCTGGCCTTT TGCTCACATG TTCTTTCCTG 
4921 CGTTATCCCC TGATTCTGTG GATAACCGTA TTACCGCCTT TGAGTGAGCT GATACCGCTC 
4981 GCCGCAGCCG AACGACCGAG CGCAGCGAGT CAGTGAGCGA GGAAGCGGAA GAGCGCCTGA 
5041 TGCGGTATTT TCTCCTTACG CATCTGTGCG GTATTTCACA CCGCATAAAT TCCGACACCA 
5101 TCGAATGGTG CAAAACCTTT CGCGGTATGG CATGATAGCG CCCGGAAGAG AGTCAATTCA 
5161 GGGTGGTGAA TGTGAAACCA GTAACGTTAT ACGATGTCGC AGAGTATGCC GGTGTCTCTT 
5221 ATCAGACCGT TTCCCGCGTG GTGAACCAGG CCAGCCACGT TTCTGCGAAA ACGCGGGAAA 
5281 AAGTGGAAGC GGCGATGGCG GAGCTGAATT ACATTCCCAA CCGCGTGGCA CAACAACTGG 
5341 CGGGCAAACA GTCGTTGCTG ATTGGCGTTG CCACCTCCAG TCTGGCCCTG CACGCGCCGT 
5401 CGCAAATTGT CGCGGCGATT AAATCTCGCG CCGATCAACT GGGTGCCAGC GTGGTGGTGT 
5461 CGATGGTAGA ACGAAGCGGC GTCGAAGCCT GTAAAGCGGC GGTGCACAAT CTTCTCGCGC 
5521 AACGCGTCAG TGGGCTGATC ATTAACTATC CGCTGGATGA CCAGGATGCC ATTGCTGTGG 
5581 AAGCTGCCTG CACTAATGTT CCGGCGTTAT TTCTTGATGT CTCTGACCAG ACACCCATCA 
5641 ACAGTATTAT TTTCTCCCAT GAAGACGGTA CGCGACTGGG CGTGGAGCAT CTGGTCGCAT 
5701 TGGGTCACCA GCAAATCGCG CTGTTAGCGG GCCCATTAAG TTCTGTCTCG GCGCGTCTGC 
• 5761 GTCTGGCTGG CTGGCATAAA TATCTCACTC GCAATCAAAT TCAGCCGATA GCGGAACGGG 
5821 AAGGCGACTG GAGTGCCATG TCCGGTTTTC AACAAACCAT GCAAATGCTG AATGAGGGCA 
5881 TCGTTCCCAC TGCGATGCTG GTTGCCAACG ATCAGATGGC GCTGGGCGCA ATGCGCGCCA 
5941 TTACCGAGTC CGGGCTGCGC GTTGGTGCGG ATATCTCGGT AGTGGG ATAC GACGATACCG 
6001 AAGACAGCTC ATGTTATATC CCGCCGTTAA CCACCATCAA ACAGGATTTT CGCCTGCTGG 
6061 GGCAAACCAG CGTGGACCGC TTGCTGCAAC TCTCTCAGGG CCAGGCGGTG AAGGGCAATC 
6121 AGCTGTTGCC CGTCTCACTG GTGAAAAGAA AAACCACCCT GGCGCCCAAT ACGCAAACCG 
6181 CCTCTCCCCG CGCGTTGGCC GATTCATTAA TGCAGCTGGC ACGACAGGTT TCCCGACTGG 
6241 AAAGCGGGCA GTGAGCGCAA CGCAATTAAT GTGAGTTAGC TCACTCATTA GGCACCCCAG 
6301 GCTTTACACT TTATGCTTCC GGCTCGTATG TTGTGTGGAA TTGTGAGCGG ATAACAATTT 
6361 CACACAGGAA ACAGCTATGA CCATGATTAC GGATTCACTG GCCGTCGTTT TACAACGTCG 
6421 TGACTGGGAA AACCCTGGCG TTACCCAACT TAATCGCCTT GCAGCACATC CCCCTTTCGC 
6481 CAGCTGGCGT AATAGCGAAG AGGCCCGCAC CGATCGCCCT TCCCAACAGT TGCGCAGCCT 
6541 GAATGGCGAA TGGCGCTTTG CCTGGTTTCC GGCACCAGAA GCGGTGCCGG AAAGCTGGCT 
6601 GGAGTGCGAT CTTCCTGAGG CCGATACTGT CGTCGTCCCC TCAAACTGGC AGATGCACGG 
6661 TTACGATGCG CCCATCTACA CCAACGTAAC CTATCCCATT ACGGTCAATC CGCCGTTTGT 
6721 TCCCACGGAG AATCCGACGG GTTGTTACTC GCTCACATTT AATGTTGATG AAAGCTGGCT 
6781 ACAGGAAGGC CAGACGCGAA TTATTTTTGA TGGCGTTGGA ATT 
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pDEST4 6964 bp 



Location (Base Nos.) 



964.. 1003 
1577.. 1453 
1827 . . 2486 
2606.. 2690 
2828.. 3 133 
3174.. 3298 
3872.. 4777 
5378.. 5538 
5778.. 6215 
6587. .704 



Gene Encoded 

Trc 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 

ori 

flori (fl intergenic region) 
laclq 



1 CTATCCGCTG GATGACCAGG ATGCCATTGC TGTGGAAGCT GCCTGCACTA ATGTTCCGGC 
61 GTTATTTCTT GATGTCTCTG ACCAGACACC CATCAACAGT ATTATTTTCT CCCATGAAGA 
121 CGGTACGCGA CTGGGCGTGG AGCATCTGGT CGCATTGGGT CACCAGCAAA TCGCGCTGTT 
181 AGCGGGCCCA TTAAGTTCTG TCTCGGCGCG TCTGCGTCTG GCTGGCTGGC ATAAATATCT 
241 CACTCGCAAT CAAATTCAGC CGATAGCGGA ACGGGAAGGC GACTGGAGTG CCATGTCCGG 
301 TTTTCAACAA ACCATGCAAA TGCTGAATGA GGGCATCGTT CCCACTGCGA TGCTGGTTGC 
361 CAACGATCAG ATGGCGCTGG GCGCAATGCG CGCCATTACC GAGTCCGGGC TGCGCGTTGG 
421 TGCGGATATC TCGGTAGTGG GATACGACGA TACCGAAGAC AGCTCATGTT ATATCCCGCC 
481 GTCAACCACC ATCAAACAGG ATTTTCGCCT GCTGGGGCAA ACCAGCGTGG ACCGCTTGCT 
541 GCAACTCTCT CAGGGCCAGG CGGTGAAGGG CAATCAGCTG TTGCCCGTCT CACTGGTGAA 
601 AAGAAAAACC ACCCTGGCAC CCAATACGCA AACCGCCTCT CCCCGCGCGT TGGCCGATTC 
661 ATTAATGCAG CTGGCACGAC AGGTTTCCCG ACTGGAAAGC GGGCAGTGAG CGCAACGCAA 
721 TTAATGTGAG TTAGCGCGAA TTGATCTGGT TTGACAGCTT ATCATCGACT GCACGGTGCA 
781 CCAATGCTTC TGGCGTCAGG CAGCCATCGG AAGCTGTGGT ATGGCTGTGC AGGTC GTAAA 
841 TCACTGCATA ATTCGTGTCG CTCAAGGCGC ACTCCCGTTC TGGATAATGT TTTTTGCGCC 
901 GACATCATAA CGGTTCTGGC AAATATTCTG AAATGAGCTG TTGACAATTA ATCATCCGGT 
961 CCGTATAATC TGTGGAATTG TGAGCGGATA ACAATTTCAC ACAGGAAACA GACCATGGGT 
1021 CATCATCATC ATCATCACGA TTACGATATC CCAACGACCG AAAACCTGTA TTTTCAGGGC 
1081 GCCCATATGA GCGATAAAAT TATTCACCTG ACTGACGACA GTTTTGACAC GGATGTACTC 
1141 AAAGCGGACG GGGCGATCCT CGTCGATTTC TGGGCAGAGT GGTGCGGTCC GTGCAAAATG 
1201 ATCGCCCCGA TTCTGGATGA AATCGCTGAC GAATATCAGG GCAAACTGAC CGTTGCAAAA 
1261 CTGAACATCG ATCAAAACCC TGGCACTGCG CCGAAATATG GCATCCGTGG TATCCCGACT 
1321 CTGCTGCTGT TCAAAAACGG TGAAGTGGCG GCAACCAAAG TGGGTGCACT GTCTAAAGGT 
1381 CAGTTGAAAG AGTTCCTCGA CGCTAACCTG GCCGGTTCTG GTTCTGGTGA TGACGATGAC 
1441 AAGGTACCCA TCACAAGTTT GTACAAAAAA GCTGAACGAG AAACGTAAAA TGATATAAAT 
1501 ATCAATATAT TAAATTAGAT TTTGCATAAA AAACAGACTA CATAATACTG TAAAACACAA 
1561 CATATCCAGT CACTATGGCG GCCGCTAAGT TGGCAGCATC ACCCGACGCA CTTTGCGCCG 
1621 AATAAATACC TGTGACGGAA GATCACTTCG CAGAATAAAT AAATCCTGGT GTCCCTGTTG 
1681 ATACCGGGAA GCCCTGGGCC AACTTTTGGC GAAAATGAGA CGTTGATCGG CACGTAAGAG 
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1741 GTTCCAACTT TCACCATAAT GAAATAAGAT CACTACCGGG CGTAI I I I I I GAGTTATCGA 
1801 GATTTTCAGG AGCTAAGGAA GCTAAAATGG AGAAAAAAAT CACTGGATAT ACCACCGTTG 
1861 ATATATCCCA ATGGCATCGT AAAGAACATT TTGAGGCATT TCAGTCAGTT GCTCAATGTA 
1921 CCTATAACCA GACCGTTCAG CTGGATATTA CGGCCTTTTT AAAGACCGTA AAGAAAAATA 
1981 AGCACAAGTT TTATCCGGCC TTTATTCACA TTCTTGCCCG CCTGATGAAT GCTCATCCGG 
2041 AATTCCGTAT GGCAATGAAA GACGGTGAGC TGGTGATATG GGATAGTGTT CACCCTTGTT 
2101 ACACCGTTTT CCATGAGCAA ACTGAAACGT TTTCATCGCT CTGGAGTGAA TACCACGACG 
2161 ATTTCCGGCA GTTTCTACAC ATATATTCGC AAGATGTGGC GTGTTACGGT GAAAACCTGG 
2221 CCTATTTCCC TAAAGGGTTT ATTGAGAATA TGTTTTTCGT CTCAGCCAAT CCCTGGGTGA 
2281 GTTTCACCAG TTTTGATTTA AACGTGGCCA ATATGGACAA CTTCTTCGCC CCCGTTTTCA 
2341 CCATGGGCAA ATATTATACG CAAGGCGACA AGGTGCTGAT GCCGCTGGCG ATTCAGGTTC 
2401 ATCATGCCGT CTGTGATGGC TTCCATGTCG GCAGAATGCT TAATGAATTA CAACAGTACT 
2461 GCGATGAGTG GCAGGGCGGG GCGTAAAGGC GTGGATCCGG CTTACTAAAA GCCAGATAAC 
2521 AGTATGCGTA TTTGCGCGCT GATTTTTGCG GTATAAGAAT ATATACTGAT ATGTATACCC 
2581 GAAGTATGTC AAAAAGAGGT GTGCTATGAA GCAGCGTATT ACAGTGACAG TTGACAGCGA 
2641 CAGCTATCAG TTGCTCAAGG CATATATGAT GTCAATATCT CCGGTCTGGT AAGCACAACC 
2701 ATGCAGAATG AAGCCCGTCG TCTGCGTGCC GAACGCTGGA AAGCGGAAAA TCAGGAAGGG 
2761 ATGGCTGAGG TCGCCCGGTT TATTGAAATG AACGGCTCTT TTGCTGACGA GAACAGGGAC 
2821 TGGTGAAATG CAGTTTAAGG TTTACACCTA TAAAAGAGAG AGCCGTTATC GTCTGTTTGT 
2881 GGATGTACAG AGTGATATTA TTGACACGCC CGGGCGACGG ATGGTGATCC CCCTGGCCAG 
2941 TGCACGTCTG CTGTCAGATA AAGTCTCCCG TGAACTTTAC CCGGTGGTGC ATATCGGGGA 
3001 TGAAAGCTGG CGCATGATGA CCACCGATAT GGCCAGTGTG CCGGTCTCCG TTATCGGGGA 
3061 AGAAGTGGCT GATCTCAGCC ACCGCGAAAA TGACATCAAA AACGCCATTA ACCTGATGTT 
3121 CTGGGGAATA TAAATGTCAG GCTCCCTTAT ACACAGCCAG TCTGCAGGTC GACCATAGTG 
3181 ACTGGATATG TTGTGTTTTA CAGTATTATG TAGTCTGTTT TTTATGCAAA ATCTAATTTA 
3241 ATATATTGAT ATTTATATCA TTTTACGTTT CTCGTTCAGC TTTCTTGTAC AAAGTGGTGA 
3301 TGGGGATCCT CTAGAGTCGA CCTGCAGTAA TCGTACAGGG TAGTACAAAT AAAAAAGGCA 
3361 CGTCAGATGA CGTGCCTTTT TTCTTGTGAG CAGTAAGCTT GGCTGTTTTG GCGGATGAGA 
3421 GAAGATTTTC AGCCTGATAC AGATTAAATC AGAACGCAGA AGCGGTCTGA TAAAACAGAA 
3481 TTTGCCTGGC GGCAGTAGCG CGGTGGTCCC ACCTGACCCC ATGCCGAACT CAGAAGTGAA 
3541 ACGCCGTAGC GCCGATGGTA GTGTGGGGTC TCCCCATGCG AGAGTAGGGA ACTGCCAGGC 
3601 ATCAAATAAA ACGAAAGGCT CAGTCGAAAG ACTGGGCCTT TCGTTTTATC TGTTGTTTGT 
3661 CGGTGAACGC TCTCCTGAGT AGGACAAATC CGCCGGGAGC GGATTTGAAC GTTGCGAAGC 
3721 AACGGCCCGG AGGGTGGCGG GCAGGACGCC CGCCATAAAC TGCCAGGCAT CAAATTAAGC 
3781 AGAAGGCCAT CCTGACGGAT GGCCTTTTTG CGTTTCTACA AACTCTTTTT GTTTATTTTT 
3841 CTAAATACAT TCAAATATGT ATCCGCTCAT GAGACAATAA CCCTGATAAA TGCTTCAATA 
3901 ATATTGAAAA AGGAAGAGTA TGAGTATTCA ACATTTCCGT GTCGCCCTTA TTCCCTTTTT 
3961 TGCGGCATTT TGCCTTCCTG TTTTTGCTCA CCCAGAAACG CTGGTGAAAG TAAAAGATGC 
4021 TGAAGATCAG TTGGGTGCAC GAGTGGGTTA CATCGAACTG GATCTCAACA GCGGTAAGAT 
4081 CCTTGAGAGT TTTCGCCCCG AAGAACGTTT TCCAATGATG AGCACTTTTA AAGTTCTGCT 
4141 ATGTGGCGCG GTATTATCCC GTGTTGACGC CGGGCAAGAG CAACTCGGTC GCCGCATACA 
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4201 CTATTCTCAG AATGACTTGG TTGAGTACTC ACCAGTCACA GAAAAGCATC TTACGGATGG 
4261 CATGACAGTA AGAGAATTAT GCAGTGCTGC CATAACCATG AG TGATAA CA CTGCGGCCAA 
4321 CTTACTTCTG ACAACGATCG GAGGACCGAA GGAGCTAACC GCI I I I I IGC ACAACATGGG 
4381 GGATCATGTA ACTCGCCTTG ATCGTTGGGA ACCGGAGCTG AATGAAGCCA TACCAAACGA 
4441 CGAGCGTGAC ACCACGATGC CTACAGCAAT GGCAACAACG TTGCGCAAAC TATTAACTGG 
4501 CGAACTACTT ACTCTAGCTT CCCGGCMCA ATTAATAGAC TGGATGGAGG CGGATAAAGT 
4561 TGCAGGACCA CTTCTGCGCT CGGCCCTTCC GGCTGGCTGG TTTATTGCTG ATAAATCTGG 
4621 AGCCGGTGAG CGTGGGTCTC GCGGTATCAT TGCAGCACTG GGGCCAGATG GTAAGCCCTC 
4681 CCGTATCGTA GTTATCTACA CGACGGGGAG TCAGGCAACT ATGGATGAAC GAAATAGACA 
4741 GATCGCTGAG ATAGGTGCCT CACTGATTAA GCATT GGTAA CTGTCAGACC AAGTTTACTC 
4801 ATATATACTT TAGATTGATT TAAAACTTCA TTTTTAATTT AAAAGGATCT AGGTGAAGAT 
4861 CCTTTTTGAT AATCTCATGA CCAAAATCCC TTAACGTGAG TTTTCGTTCC ACTGAGCGTC 
4921 AGACCCCGTA GAAAAGATCA AAGGATCTTC TTGAGATCCT I I I I I ICTGC GCGTAATCTG 
4981 CTGCTTGCAA ACAAAAAAAC CACCGCTACC AGCGGTGGTT TGTTTGCCGG ATCAAGAGCT 
5041 ACCAACTCTT TTTCCGAAGG TAACTGGCTT CAGCAGAGCG CAGATACCAA ATACTGTCCT 
5101 TCTAGTGTAG CCGTAGTTAG GCCACCACTT CAAGAACTCT GTAGCACCGC CTACATACCT 
5161 CGCTCTGCTA ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC GATAAGTCGT GTCTTACCGG 
5221 GTTGGACTCA AGACGATAGT TACCGGATAA GGCGCAGCGG TCGGGCTGAA CGGGGGGTTC 
5281 GTGCACACAG CCCAGCTTGG AGCGAACGAC CTACACCGAA CTGAGATACC TACAGCGTGA 
5341 GCTATGAGAA AGCGCCACGC TTCCCGAAGG GAGAAAGGCG GACAGGTATC CGGTAAGCGG 
5401 CAGGGTCGGA ACAGGAGAGC GCACGAGGGA GCTTCCAGGG GGAAA CGCCT GGTATCTTTA 
5461 TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG 
5521 GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG 
5581 CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT GATTCTGTGG ATAACCGTAT 
5641 TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA ACGACCGAGC GCAGCGAGTC 
5701 AGTGAGCGAG GAAGCGGAAG AGCGCCTGAT GCGGTATTTT CTCCTTACGC ATCTGTGCGG 
5761 TATTTCACAC CGCATAATTT TGTTAAAATT CGCGTTAAAT TTTTGTTAAA TCAGCTCATT 
5821 TTTTAACCAA TAGGCCGAAA TCGGCAAAAT CCCTTATAAA TCAAAAGAAT AGACCGAGAT 
5881 AGGGTTGAGT GTTGTTCCAG TTTGGAACAA GAGTCCACTA TTAAAGAACG TGGACTCCAA 
5941 CGTCAAAGGG CGAAAAACCG TCTATCAGGG CGATGGCCCA CTACGTGAAC CATCACCCTA 
6001 ATCAAGTTTT TTGGGGTCGA GGTGCCGTAA AGCACTAAAT CGGAACCCTA AAGGGAGCCC 
6061 CCGATTTAGA GCTTGACGGG GAAAGCCGGC GAACGTGGCG AGAAAGGAAG GGAAGAAAGC 
6121 GAAAGGAGCG GGCGCTAGGG CGCTGGCAAG TGTAGCGGTC ACGCTGCGCG TAACCACCAC 
6181 ACCCGCCGCG CTTAATGCGC CGCTACAGGG CGCGTCCATT CGCCATTCAG GCTGCTATGG 
6241 TGCACTCTCA GTACAATCTG CTCTGATGGC GCATAGTTAA GCCAGTATAC ACTCCGCTAT 
6301 CGCTACGTGA CTGGGTCATG GCTGCGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT 
6361 GACGGGCTTG TCTGCTCCCG GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT 
6421 GCATGTGTCA GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGGCAGCAG ATCAATTCGC 
6481 GCGCGAAGGC GAAGCGGCAT GCATTTACGT TGACACCATC GAATGGTGCA AAACCTTTCG 
6541 CGGTATGGCA TGATAGCGCC CGGAAGAGAG TCAATTCAGG GTGGTGAATG TGAAACCAGT 
6601 AACGTTATAC GATGTCGCAG AGTATGCCGG TGTCTCTTAT CAGACCGTTT CCCGCGTGGT 
6661 GAACCAGGCC AGCCACGTTT CTGCGAAAAC GCGGGAAAAA GTGGAAGCGG CGATGGCGGA 
6721 GCTGAATTAC ATTCCCAACC GCGTGGCACA ACAACTGGCG GGCAAACAGT CGTTGCTGAT 
6781 TGGCGTTGCC ACCTCCAGTC TGGCCCTGCA CGCGCCGTCG CAAATTGTCG CGGCGATTAA 
6841 ATCTCGCGCC GATCAACTGG GTGCCAGCGT GGTGGTGTCG ATGGTAGAAC GAAGCGGCGT 
6901 CGAAGCCTGT AAAGCGGCGG TGCACAATCT TCTCGCGCAA CGCGTCAGTN GGGCTGATCA 
6961 TTAA 
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pSPORT V (FOR SEQUENCING, PROBES, PHAGEW1D) 



-35 



loc promoter 



-10 



ac RNA 



1 ogg cac ccc ogg ctt tac opt tto tgc ttc egg etc gta tgt tgt gtg gaa 
tec gtg ggg tec gao otg t|ga aot acg oag gee gag c ot oco a :o coc ctt 



"reverse" 



sequencing primers 
' " " : h*— ol - peptide 

52 ttg tga gcg gat aac aat ttc aca cag gaa aco get atg occ atg att acg 
aac act cgc eta ttg tta aag tgt gtc ctt tgt ego tac tgg tac taa tgc 

T7 Promoter p- T7 RNA Pst Kpn 

103 cca age tclt aat acg act cac tat agg daa age tgg tac gee tgcjggtacj 
ggt teg aga tta tgc tga gtg ato tec ctt teg occ atg eggjacg tcjc atg 



EcoRI Sma 
154 egg tec gg ja att ccc 
gee agg cct taa I ggg 



Sal Int ottRI 

ggg [icq a cg ate loco aqt tt |o>tg>0afto^ 
ccc age tjgc tog tgt tea aoc atg ttt 




Int 



Gene 
attR2 



Spe 



1990 




ttc acc ap t agt gat cjag ccg 



ct tgt aca aag tgg tjga tea |cta g tc ggc 

Not Xba Bam Hindi 1 1 Mlu Sph 

2041 [ggc c gc t |ct ago g |go tec o |ag ct t acg ta |c gcg tgcatgj cga cgt cat age 

ccg gj;g aga tcjt cct agjg ttc ga|a tgc atg cgc] acjg tac get gca gta teg 

SP6 Promoter 



2092 tct t ct ata gtg tea cct aaa ttc att tea ctg gec gtc gtt tta caa cgt 
aga o gja tat coc agt gga ttt a ag tto ag t gac egg cog caa aat gtt gca 

SP6 RNA "forward sequencing... 



2143 cgt gac tgg gaa aac cct ggc gtt acc caa ctt aat cgc ctt gca gca cat 

gca ctg acc ctt ttg gga ccg cao tgg gtt gaa tta gcg gaa cgt cgt gta 

**- — 

...primers 
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PflMI 



BspLU11l — 

<> or. 



AlwNl 




Sse8387l 
Pstl 
BspMI 
Asp718l 
Kpnl 
Agel 
Rsrll 
BspEI 
EcoRI 
Aval 
Smal 
Xmol 
Accl 
Sail 



Ahdl 
Bsal 
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pDEST5 



Spel 

Notl 

Xmolll 

Xbal 

BamHI 

Hindlll 

SnaBI 

Spll 

Mlul 

Sphl 

Aatll 
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pDEST5 5957 bp 



Location (Base Nos.) 
305.. 181 



Gene Encoded 



555.. 1214 
1334. .1418 
1556.. 1861 
1902.. 2026 
2278..2733 
2865.. 3722 
5378.. 5538 
4756.. 5922 



attRl 
CmR 



inactivated ccdA 



ccdB 
attR2 



fl (fl intergenic region) 



ampR 

ori 

lad 



1 AGGCACCCCA GGCTTTACAC TTTATGCTTC CGGCTCGTAT GTTGTGTGGA ATTGTGAGCG 
61 GATAACAATT TCACACAGGA AACAGCTATG ACCATGATTA CGCCMGCTC TAATACGACT 
121 CACTATAGGG AAAGCTGGTA CGCCTGCAGG TACCGGTCCG GAATTCCCGG GTCGACGATC 
181 ACAAGTTTGT ACAAAAAAGC TGAACGAGAA ACGTAAAATG ATATAAATAT CAATATATTA 
241 AATTAGATTT TGCATAAAAA ACAGACTACA TAATACTGTA AAACACAACA TATCCAGTCA 
301 CTATGGCGGC CGCTAAGTTG GCAGCATCAC CCGACGCACT TTGCGCCGAA TAAATACCTG 
361 TGACGGAAGA TCACTTCGCA GAATAAATAA ATCCTGGTGT CCCTGTTGAT ACCGGGAAGC 
421 CCTGGGCCAA CTTTTGGCGA AAATGAGACG TT GATCGG CA CGTAAGAGGT TCCAA CTTTC 
481 ACCATAATGA AATAAGATCA CTACCGGGCG TAI I 1 1 I IGA GTTATCGAGA TTTTCAGGAG 
541 CTAAGGAAGC TAAAATGGAG AAAAAAATCA CTGGATATAC CACCGTTGAT ATATCCCAAT 
601 GGCATCGTAA AGAACATTTT GAGGCATTTC AGTCAGTTGC TCAATGTACC TATAACCAGA 
661 CCGTTCAGCT GGATATTACG GCCTTTTTAA AGACCGTAAA GAAAAATAAG CACAAGTTTT 
721 ATCCGGCCTT TATTCACATT CTTGCCCGCC TGATGAATGC TCATCCGGAA TTCC GTAT GG 
781 CAATGAAAGA CGGTGAGCTG GTGATATGGG ATAGTGTTCA CCCTTGTTAC ACCGTTTTCC 
841 ATGAGCAAAC TGAAACGTTT TCATCGCTCT GGAGTGAATA CCACGACGAT TTCCGGCAGT 
901 TTCTACACAT ATATTCGCAA GATGTGGCGT GTTACGGTGA AAACCTGGCC TATTTCCCTA 
961 AAGGGTTTAT TGAGAATATG TTTTTCGTCT CAGCCAATCC CT GGGTG AGT TTCACCAGTT 
1021 TTGATTTAAA CGTGGCCAAT ATGGACAACT TCTTCGCCCC CGTTTTCACC ATGGGCAAAT 
1081 ATTATACGCA AGGCGACAAG GTGCTGATGC CGCTGGCGAT TCAGGTTCAT CATGCCGTCT 
1141 GTGATGGCTT CCATGTCGGC AGAATGCTTA ATGAATTACA ACAGTACTGC GATGAGTGGC 
1201 AGGGCGGGGC GTAAACGCGT GGATCCGGCT TACTAAAAGC CAGATAACAG TATGCGTATT 
1261 TGCGCGCTGA TTTTTGCGGT ATAAGAATAT ATACTGATAT GTATACCCGA AGTATGTCAA 
1321 AAAGAGGTGT GCTATGAAGC AGCGTATTAC AGTGACAGTT GACAGCGACA GCTATCAGTT 
1381 GCTCAAGGCA TATATGATGT CAATATCTCC GGTCTGGTAA GCACAACCAT GCAGAATGAA 
1441 GCCCGTCGTC TGCGTGCCGA ACGCTGGAAA GCGGAAAATC AGGAAGGGAT GGCTGAGGTC 
1501 GCCCGGTTTA TTGAAATGAA CGGCTCTTTT GCTGACGAGA ACAGGGACTG GTGAAATGCA 
1561 GTTTAAGGTT TACACCTATA AAAGAGAGAG CCGTTATCGT CTGTTTGTGG ATGTACAGAG 
1621 TGATATTATT GACACGCCCG GGCGACGGAT GGTGATCCCC CTGGCCAGTG CACGTCTGCT 
1681 GTCAGATAAA GTCTCCCGTG AACTTTACCC GGTGGTGCAT ATCGGGGATG AAAGCTGGCG 
1741 CATGATGACC ACCGATATGG CCAGTGTGCC GGTCTCCGTT ATCGGGGAAG AAGTGGCTGA 
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1801 TCTCAGCCAC CGCGAAAATG ACATCAAAAA 
1861 AATGTCAGGC TCCCTTATAC ACAGCCAGTC 
1921 GTGTTTTACA GTATTATGTA GTCTGTTTTT 
1981 TTATATCATT TTACGTTTCT CGTTCAGCTT 
2041 GGCCGCTCTA GAGGATCCAA GCTTACGTAC 
2101 GTGTCACCTA AATTCAATTC ACTGGCCGTC 
2161 GGCGTTACCC AACTTAATCG CCTTGCAGCA 
2221 GAAGAGGCCC GCACCGATCG CCCTTCCCAA 
2281 CGCCCTGTAG CGGCGCATTA AGCGCGGCGG 
2341 CACTTGCCAG CGCCCTAGCG CCCGCTCCTT 
2401 TCGCCGGCTT TCCCCGTCAA GCTCTAAATC 
2461 CTTTACGGCA CCTCGACCCC AAAAAACTTG 
2521 CGCCCTGATA GACGGTTTTT CGCCCTTTGA 
2581 TCTTGTTCCA AACTGGAACA ACACTCAACC 
2641 GGATTTTGCC GATTTCGGCC TATTGGTTAA 
2701 CGAATTTTAA CAAAATATTA ACGTTTACAA 
2761 GCGGAACCCC TATTTGTTTA TTTTTCTAAA 
2821 AATAACCCTG ATAAATGCTT CAATAATATT 
2881 TCCGTGTCGC CCTTATTCCC I I 1 1 I IGCGG 
2941 AAACGCTGGT GAAAGTAAAA GATGCTGAAG 
3001 MCTGGATCT CAACAGCGGT AAGATCCTTG 
3061 TGATGAGCAC TTTTAAAGTT CTGCTATGTG 
3121 AAGAGCAACT CGGTCGCCGC ATACACTATT 
3181 TCACAGAAAA GCATCTTACG GATGGCATGA 
3241 CCATGAGTGA TAACACTGCG GCCAACTTAC 
3301 TAACCGCTTT TTTGCACAAC ATGGGGGATC 
3361 AGCTGAATGA AGCCATACCA AACGACGAGC 
3421 CAACGTTGCG CAAACTATTA ACTGGCGAAC 
3481 TAGACTGGAT GGAGGCGGAT AAAGTTGCAG 
3541 GCTGGTTTAT TGCTGATAAA TCTGGAGCCG 
3601 CACTGGGGCC AGATGGTAAG CCCTCCCGTA 
3661 CAACTATGGA TGAACGAAAT AGACAGATCG 
3721 GGTAACTGTC AGACCAAGTT TACTCATATA 
3781 AATTTAAAAG GATCTAGGTG AAGATCCTTT 
3841 GTGAGTTTTC GTTCCACTGA GCGTCAGACC 
3901 ATCCI I I I I I TCTGCGCGTA ATCTGCTGCT 
3961 TGGTTTGTTT GCCGGATCAA GAGCTACCAA 
4021 GAGCGCAGAT ACCAAATACT GTCCTTCTAG 
4081 ACTCTGTAGC ACCGCCTACA TACCTCGCTC 
4141 GTGGCGATAA GTCGTGTCTT ACCGGGTTGG 
4201 AGCGGTCGGG CTGAACGGGG GGTTCGTGCA 



CGCCATTAAC CTGATGTTCT GGGGAATATA 
TGCAGGTCGA CCATAGTGAC TGGATATGTT 
TATGCAAAAT CTAATTTAAT ATATTGATAT 
TCTTGTACAA AGTGGTGATC ACTAGTCGGC 
GCGTGCATGC GACGTCATAG CTCTTCTATA 
GTTTTACAAC GTCGTGACTG GGAAAACCCT 
CATCCCCCTT TCGCCAGCTG GCGTAATAGC 
CAGTTGCGCA GCCTGAATGG CGAATGGACG 
GTGTGGTGGT TACGCGCAGC GTGACCGCTA 
TCGCTTTCTT CCCTTCCTTT CTCGCCACGT 
GGGGGCTCCC TTTAGGGTTC CGATTTAGTG 
ATTAGGGTGA TGGTTCACGT AGTGGGCCAT 
CGTTGGAGTC CACGTTCTTT AATAGTGGAC 
CTATCTCGGT CTATTCTTTT GATTTATAAG 
AAAATGAGCT GATTTAACAA AAATTTAACG 
TTTCAGGTGG CACTTTTCGG GGAAATGTGC 
TACATTCAAA TATGTATCCG CTCATGAGAC 
GAAAAAGGAA GAGTATGAGT ATTCAACATT 
CATTTTGCCT TCCTGTTTTT GCTCACCCAG 
ATCAGTTGGG TGCACGAGTG GGTTACATCG 
AGAGTTTTCG CCCCGAAGAA CGTTTTCCAA 
GCGCGGTATT ATCCCGTATT GACGCCGGGC 
CTCAGAATGA CTTGGTTGAG TACTCACCAG 
CAGTAAGAGA ATTATGCAGT GCTGCCATAA 
TTCTGACAAC GATCGGAGGA CCGAAGGAGC 
ATGTAACTCG CCTTGATCGT TGGGAACCGG 
GTGACACCAC GATGCCTGTA GCAATGGCAA 
TACTTACTCT AGCTTCCCGG CAACAATTAA 
GACCACTTCT GCGCTCGGCC CTTCCGGCTG 
GTGAGCGTGG GTCTCGCGGT ATCATTGCAG 
TCGTAGTTAT CTACACGACG GGGAGTCAGG 
CTGAGATAGG TGCCTCACTG ATTAAGCATT 
TACTTTAGAT TGATTTAAAA CTTCATTTTT 
TTGATAATCT CATGACCAAA ATCCCTTAAC 
CCGTAGAAAA GATCAAAGGA TCTTCTTGAG 
TGCAAACAAA AAAACCACCG CTACCAGCGG 
CTCTTTTTCC GAAGGTAACT GGCTTCAGCA 
TGTAGCCGTA GTTAGGCCAC CACTTCAAGA 
TGCTAATCCT GTTACCAGTG GCTGCTGCCA 
ACTCAAGACG ATAGTTACCG GATAAGGCGC 
CACAGCCCAG CTTGGAGCGA ACGACCTACA 
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4261 CCGAACTGAG ATACCTACAG CGTGAGCATT 
4321 AGGCGGACAG GTATCCGGTA AGCGGCAGGG 
4381 CAGGGGGAAA CGCCTGGTAT CTTTATAGTC 
4441 GTCGATTTTT GTGATGCTCG TCAGGGGGGC 
4501 CCTTTTTACG GTTCCTGGCC TTTTGCTGGC 
4561 CCCCTGATTC TGTGGATAAC CGTATTACCG 
4621 GCCGAACGAC CGAGCGCAGC GAGTCAGTGA 
4681 AACCGCCTCT CCCCGCGCGT TGGCCGATTC 
4741 AGGCGAAGCG GCATTTACGT TGACACCATC 
4801 TGATAGCGCC CGGAAGAGAG TCAATTCAGG 
4861 GATGTCGCAG AGTATGCCGG TGTCTCTTAT 
4921 AGCCACGTTT CTGCGAAAAC GCGGGAAAAA 
4981 ATTCCCAACC GCGTGGCACA ACAACTGGCG 
5041 ACCTCCAGTC TGGCCCTGCA CGCGCCGTCG 
5101 GATCAACTGG GTGCCAGCGT GGTGGTGTCG 
5161 AAAGCGGCGG TGCACAATCT TCTCGCGCAA 
5221 CTGGATGACC AGGATGCCAT TGCTGTGGAA 
5281 CTTGATGTCT CTGACCAGAC ACCCATCAAC 
5341 CGACTGGGCG TGGAGCATCT GGTCGCATTG 
5401 CCATTAAGTT CTGTCTCGGC GCGTCTGCGT 
5461 AATCAAATTC AGCCGATAGC GGAACGGGAA 
5521 CAAACCATGC AAATGCTGAA TGAGGGCATC 
5581 CAGATGGCGC TGGGCGCAAT GCGCGCCATT 
5641 ATCTCGGTAG TGGGATACGA CGATACCGAA 
5701 ACCATCAAAC AGGATTTTCG CCTGCTGGGG 
5761 TCTCAGGGCC AGGCGGTGAA GGGCAATCAG 
5821 ACCACCCTGG CGCCCAATAC GCAAACCGCC 
5881 CAGCTGGCAC GACAGGTTTC CCGACTGGAA 
5941 GAGTTAGCTC ACTCATT 



GAGAAAGCGC CACGCTTCCC GAAGGGAGAA 
TCGGAACAGG AGAGCGCACG AGGGAGCTTC 
CTGTCGGGTT TCGCCACCTC TGACtTGAGC 
GGAGCCTATG GAAAAACGCC AGCAACGCGG 
CTTTTGCTCA CATGTTCTTT CCTGCGTTAT 
CCTTTGAGTG AGCTGATACC GCTCGCCGCA 
GCGAGGAAGC GGAAGAGCGC CCAATACGCA 
ATTAATGCAG AGCTTGCAAT TCGCGCGCGA 
GAATGGCGCA AAACCTTTCG CGGTATGGCA 
GTGGTGAATG TGAAACCAGT AACGTTATAC 
CAGACCGTTT CCCGCGTGGT GAACCAGGCC 
GTGGAAGCGG CGATGGCGGA GCTGAATTAC 
GGCAAACAGT CGTTGCTGAT TGGCGTTGCC 
CAAATTGTCG CGGCGATTAA ATCTCGCGCC 
ATGGTAGAAC GAAGCGGCGT CGAAGCCTGT 
CGGGTCAGTG GGCTGATCAT TAACTATCCG 
GCTGCCTGCA CTAATGTTCC GGCGTTATTT 
AGTATTATTT TCTCCCATGA AGACGGTACG 
GGTCACCAGC AAATCGCGCT GTTAGCGGGC 
CTGGCTGGCT GGCATAAATA TCTCACTCGC 
GGCGACTGGA GTGCCATGTC CGGTTTTCAA 
GTTCCCACTG CGATGCTGGT TGCCAACGAT 
ACCGAGTCCG GGCTGCGCGT TGGTGCGGAT 
GACAGCTCAT GTTATATCCC GCCGTCAACC 
CAAACCAGCG TGGACCGCTT GCTGCAACTC 
CTGTTGCCCG TCTCACTGGT GAAAAGAAAA 
TCTCCCCGCG CGTTGGCCGA TTCATTAATG 
AGCGGGCAGT GAGCGCAACG CAATTAATGT 
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pSPORT (OPPOSITE STRAND) 

"FORWARD" SEQUENCING PRIMERS 
A 



Loo cgc cag ggt ttt ccc agt cac gac gtt gta aaa cga egg cca gtg aot 
att gcg gtc cca aaa ggg tea gtg ctg caa cat ttt get gee ggt cac tta 



SP6 PROMOTER 



tga 
act 



att tag gtg aca eta tag 
too ate cac tgt got otc 



Sph Miu 

aag age tat gac gtc geatcje c |cg cg t acg 
ttc teg ata ctg cag <jgt acg tgc gcja tgc 



Hind 3 Bam Xba Not Spe 

to ja get tg |g ate ct |c tag agc |ggc cg c cgo lcto gt g ate 

att cgajacc tagjgag atcjtcg ccg g|cg get gat cjac tag 



attRI Int 
oca agt tt fg/yat, 



tgt tea aac otq 





In t attR2 

t tgt oca aag tgg tba 
9^V6j^X# ttc acc g et 



T 

Sal Sma EcoRI Kpn Pst 

teg | teg oc c cbg g |ao ttc egg acc g gt ac| c tgc cj gg cgt acc age ttt 
age age tjgg ccc ctt acjg gee tgg ejea tggjacg tec gca tgg teg aaa 



ccc 

ggg 



T7 RNA 



ata tea etc age oto ot 



2041 tat agt gag teg tot tog age ttg gcg taa tea tgg tea tag ctg ttt cct 



T7 PROMOTER 



c teg aac cgc att agt acc agt ate gac aaa gga 

a -peptide -*-| 



2092 gtg tga aat tgt tat ccg etc aca att cca cac 

cac act tta aca ata ggc go ,g tgt taa g,gt gtg 

V -J 

... SEQUENCING PRIMERS lac RNA 



-10 



aac ata 
ttg tat 



"REVERSE"... 

lac PROMOTER 
cga gee gga age 
get egg cct teg 



-35 



2143 ata 


aag 


tgt 


aaa 


tat 


ttc 


aca 


ttt 



gee tgg ggt gee taa tga gtg age taa etc aca tta att 
egg acc cca egg att act cac teg att gag tgt aat taa 
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Aotll 
Sphl 
Mlul 




FIG.26B 
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55/253 



pDEST6 5957 bp 



Location (Base Nos.) 
266 . . 142 



attRl 
CmR 



Gene Encoded 



516.. 1175 
1295 . . 1379 
1517.. 1822 
1863 . . 1987 
2203.. 3369 
4403 . . 5260 
5392 . . 5847 



ccdB 
attR2 
lad 
ampR 



fl (fl intergenic region) 



inactivated ccdA 



1 TAACGCCAGG GTTTTCCCAG TCACGACGTT GTAAAACGAC GGCCAGTGAA TTGAATTTAG 
61 GTGACACTAT AGAAGAGCTA TGACGTCGCA TGCACGCGTA CGTAAGCTTG GATCCTCTAG 
121 AGCGGCCGCC GACTAGTGAT CACAAGTTTG TACAAAAAAG CTGAACGAGA AACGTAAAAT 
181 GATATAAATA TCAATATATT AAATTAGATT TTGCATAAAA AACAGACTAC ATAATACTGT 
241 AAAACACAAC ATATCCAGTC ACTATGGCGG CCGCTAAGTT GGCAGCATCA CCCGACGCAC 
301 TTTGCGCCGA ATAAATACCT GTGACGGAAG ATCACTTCGC AGAATAAATA AATCCTGGTG 
361 TCCCTGTTGA TACCGGGAAG CCCTGGGCCA ACTTTTGGCG AAAATGAGAC GTTGATCGGC 
421 ACGTAAGAGG TTCCAACTTT CACCATAATG AAATAAGATC ACTACCGGGC GT AI I 1 1 1 I G 
481 AGTTATCGAG ATTTTCAGGA GCTAAGGAAG CTAAAATGGA GAAAAAAATC ACTGGATATA 
541 CCACCGTTGA TATATCCCAA TGGCATCGTA AAGMCATTT TGAGGCATTT CAGTCAGTTG 
601 CTCAATGTAC CTATAACCAG ACCGTTCAGC TGGATATTAC GGCCTTTTTA AAGACCGTAA 
661 AGAAAAATAA GCACAAGTTT TATCCGGCCT TTATTCACAT TCTTGCCCGC CTGATGAATG 
721 CTCATCCGGA ATTCC GTAT G GCAATGAAAG ACGGTGAGCT GGTGATATGG GATAGTGTTC 
781 ACCCTTGTTA CACCGTTTTC CATGAGCAAA CTGAAACGTT TTCATCGCTC TGGAGTGAAT 
841 ACCACGACGA TTTCCGGCAG TTTCTACACA TATATTCGCA AGATGTGGCG TGTTACGGTG 
901 AAAACCTGGC CTATTTCCCT AAAGGGTTTA TTGAGAATAT GTTTTTCGTC TCAGCCAATC 
961 CCT GGGT GAG TTTCACCAGT TTTGATTTAA ACGTGGCCAA TATGGACAAC TTCTTCGCCC 
1021 CCGTTTTCAC CATGGGCAAA TATTATACGC AAGGCGACAA GGTGCTGATG CCGCTGGCGA 
1081 TTCAGGTTCA TCATGCCGTC TGTGATGGCT TCCATGTCGG CAGAATGCTT AATGAATTAC 
1141 AACAGTACTG CGATGAGTGG CAGGGCGGGG CGTAAACGCG TGGATCCGGC TTACTAAAAG 
1201 CCAGATAACA GTATGCGTAT TTGCGCGCTG ATTTTTGCGG TATAAGAATA TATACTGATA 
1261 TGTATACCCG AAGTATGTCA AAAAGAGGTG TGCTATGAAG CAGCGTATTA CAGTGACAGT 
1321 TGACAGCGAC AGCTATCAGT TGCTCAAGGC ATATATGATG TCAATATCTC CGGTCTGGTA 
1381 AGCACAACCA TGCAGAATGA AGCCCGTCGT CTGCGTGCCG AACGCTGGAA AGCGGAAAAT 
1441 CAGGAAGGGA TGGCTGAGGT CGCCCGGTTT ATTGAAATGA ACGGCTCTTT TGCTGACGAG 
1501 AACAGGGACT GGTGAAATGC AGTTTAAGGT TTACACCTAT AAAAGAGAGA GCCGTTATCG 
1561 TCTGTTTGTG GATGTACAGA GTGATATTAT TGACACGCCC GGGCGACGGA TGGTGATCCC 
1621 CCTGGCCAGT GCACGTCTGC TGTCAGATAA AGTCTCCCGT GAACTTTACC CGGTGGTGCA 
1681 TATCGGGGAT GAAAGCTGGC GCATGATGAC CACCGATATG GCCAGTGTGC CGGTCTCCGT 
1741 TATCGGGGAA GAAGTGGCTG ATCTCAGCCA CCGCGAAAAT GACATCAAAA ACGCCATTAA 
1801 CCTGATGTTC TGGGGAATAT AAATGTCAGG CTCCCTTATA CACAGCCAGT CTGCAGGTCG 
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1861 ACCATAGTGA CTGGATATGT TGTGTTTTAC AGTATTATGT AGTCTGTTTT TTATGCAAAA 
1921 TCTAATTTAA TATATTGATA TTTATATCAT TTTACGTTTC TCGTTCAGCT TTCTTGTACA 
1981 AAGTGGTGAT CGTCGACCCG GGAATTCCGG ACCGGTACCT GCAGGCGTAC CAGCTTTCCC 
2041 TATAGTGAGT CGTATTAGAG CTTGGCGTAA TCATGGTCAT AGCTGTTTCC TGTGTGAAAT 
2101 TGTTATCCGC TCACAATTCC ACACAACATA CGAGCCGGAA GCATAAAGTG TAAAGCCTGG 
2161 GGTGCCTAAT GAGTGAGCTA ACTCACATTA ATTGCGTTGC GCTCACTGCC CGCTTTCCAG 
2221 TCGGGAAACC TGTCGTGCCA GCTGCATTAA TGAATCGGCC AACGCGCGGG GAGAGGCGGT 
2281 TTGCGTATTG GGCGCCAGGG TGGTTTTTCT TTTCACCAGT GAGACGGGCA ACAGCTGATT 
2341 GCCCTTCACC GCCTGGCCCT GAGAGAGTTG CAGCAAGCGG TCCACGCTGG TTTGCCCCAG 
2401 CAGGCGAAAA TCCTGTTTGA TGGTGGTTGA CGGCGGGATA TAACATGAGC TGTCTTCGGT 
2461 ATCGTCGTAT CCCACTACCG AGATATCCGC ACCAACGCGC AGCCCGGACT CGGTAATGGC 
2521 GCGCATTGCG CCCAGCGCCA TCTGATCGTT GGCAACCAGC ATCGCAGTGG GAACGATGCC 
2581 CTCATTCAGC ATTTGCATGG TTTGTTGAAA ACCGGACATG GCACTCCAGT CGCCTTCCCG 
2641 TTCCGCTATC GGCTGAATTT GATTGCGAGT GAGATATTTA TGCCAGCCAG CCAGACGCAG 
2701 ACGCGCCGAG ACAGAACTTA ATGGGCCCGC TAACAGCGCG ATTTGCTGGT GACCCAATGC 
2761 GACCAGATGC TCCACGCCCA GTCGCGTACC GTCTTCATGG GAGAAAATAA TACTGTTGAT 
2821 GGGTGTCTGG TCAGAGACAT CAAGAAATAA CGCCGGAACA TTAGTGCAGG CAGCTTCCAC 
2881 AGCAATGGCA TCCTGGTCAT CCAGCGGATA GTTAATGATC AGCCCACTGA CCCGTTGCGC 
2941 GAGAAGATTG TGCACCGCCG CTTTACAGGC TTCGACGCCG CTTCGTTCTA CCATCGACAC 
3001 CACCACGCTG GCACCCAGTT GATCGGCGCG AGATTTAATC GCCGCGACAA TTTGCGACGG 
3061 CGCGTGCAGG GCCAGACTGG AGGTGGCAAC GCCAATCAGC AACGACTGTT TGCCCGCCAG 
3121 TTGTTGTGCC ACGCGGTTGG GAATGTAATT CAGCTCCGCC ATCGCCGCTT CCACTTTTTC 
3181 CCGCGTTTTC GCAGAAACGT GGCTGGCCTG GTTCACCACG CGGGAAACGG TCTGATAAGA 
3241 GACACCGGCA TACTCTGCGA CATCGTATAA CGTTACTGGT TTCACATTCA CCACCCTGAA 
3301 TTGACTCTCT TCCGGGCGCT ATCATGCCAT ACCGCGAAAG GTTTTGCGCC ATTCGATGGT 
3361 GTCAACGTAA ATGCCGCTTC GCCTTCGCGC GCGAATTGCA AGCTCTGCAT TAATGAATCG 
3421 GCCAACGCGC GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC TTCCGCTTCC TCGCTCACTG 
3481 ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC GAGCGGTATC AGCTCACTCA AAGGCGGTAA 
3541 TACGGTTATC CACAGAATCA GGGGATAACG CAGGAAAGM CATGTGAGCA AAAGGCCAGC 
3601 AAAAGGCCAG GAACCGTAAA AAGGCCGCGT TGCTGGCGTT TTTCCATAGG CTCCGCCCCC 
3661 CTGACGAGCA TCACAAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG ACAGGACTAT 
3721 AAAGATACCA GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT CCGACCCTGC 
3781 CGCTTACCGG ATACCTGTCC GCCTTTCTCC CTTCGGGAAG CGTGGCGCTT TCTCAATGCT 
3841 CACGCTGTAG GTATCTCAGT TCGGTGTAGG TCGTTCGCTC CAAGCTGGGC TGTGTGCACG 
3901 AACCCCCCGT TCAGCCCGAC CGCTGCGCCT TATCCGGTAA CTATCGTCTT GAGTCCAACC 
3961 CGGTAAGACA CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT AGCAGAGCGA 
4021 GGTATGTAGG CGGTGCTACA GAGTTCTTGA AGTGGTGGCC TAACTACGGC TACACTAGAA 
4081 GGACAGTATT TGGTATCTGC GCTCTGCTGA AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA 
4141 GCTCTTGATC CGGCAAACAA ACCACCGCTG GTAGCGGTGG TTTTTTTGTT TGCAAGCAGC 
4201 AGATTACGGG CAGAAAAAAA GGATCTCAAG AAGATCCTTT GATCTTTTCT ACGGGGTCTG 
4261 ACGCTCAGTG GAACGAAAAC TCACGTTAAG GGATTTTGGT CATGAGATTA TCAAAAAGGA 
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4321 TCTTCACCTA GATCCTTTTA AATTAAAAAT GAAGTTTTAA ATCAATCTAA AGTATATATG 
4381 AGTAAACTTG GTCTGACAGT TACCAATGCT TAATCAGTGA GGCACCTATC TCAGCGATCT 
4441 GTCTATTTCG TTCATCCATA GTTGCCTGAC TCCCCGTCGT GTAGATAACT ACGATACGGG 
4501 AGGGCTTACC ATCTGGCCCC AGTGCTGCAA TGATACCGCG AGACCCACGC TCACCGGCTC 
4561 CAGATTTATC AGCAATAAAC CAGCCAGCCG GAAGGGCCGA GCGCAGAAGT GGTCCTGCAA 
4621 CTTTATCCGC CTCCATCCAG TCTATTAATT GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC 
4681 CAGTTAATAG TTTGCGCAAC GTTGTTGCCA TTGCTACAGG CATCGTGGTG TCACGCTCGT 
4741 CGTTTGGTAT GGCTTCATTC AGCTCCGGTT CCCAACGATC AAGGCGAGTT ACATGATCCC 
4801 CCATGTTGTG CAAAAAAGCG GTTAGCTCCT TCGGTCCTCC GATCGTTGTC AGAAGTAAGT 
4861 TGGCCGCAGT GTTATCACTC ATGGTTATGG CAGCACTGCA TAATTCTCTT ACTGTCATGC 
4921 CATCCGTAAG ATGCTTTTCT GTGACTGGTG AGTACTCAAC CAAGTCATTC TGAGAATAGT 
4981 GTATGCGGCG ACCGAGTTGC TCTTGCCCGG CGTCAATACG GGATAATACC GCGCCACATA 
5041 GCAGAACTTT AAAAGTGCTC ATCATTGGAA AACGTTCTTC GGGGCGAAAA CTCTCAAGGA 
5101 TCTTACCGCT GTTGAGATCC AGTTCGATGT AACCCACTCG TGCACCCAAC TGATCTTCAG 
5161 CATCTTTTAC TTTCACCAGC GTTTCTGGGT GAGCAAAAAC AGGAAGGCAA AATGCCGCAA 
5221 AAAAGGGAAT AAGGGCGACA CGGAAATGTT GAATACTCAT ACTCTTCCTT TTTCAATATT 
5281 ATTGAAGCAT TTATCAGGGT TATTGTCTCA TGAGCGGATA CATATTTGAA TGTATTTAGA 
5341 AAAATAAACA AATAGGGGTT CCGCGCACAT TTCCCCGAAA AGTGCCACCT GAAATTGTAA 
5401 ACGTTAATAT TTTGTTAAAA TTCGCGTTAA ATTTTTGTTA AATCAGCTCA I I I I I IAACC 
5461 AATAGGCCGA AATCGGCAAA ATCCCTTATA AATCAAAAGA ATAGACCGAG ATAGGGTTGA 
5521 GTGTTGTTCC AGTTTGGAAC AAGAGTCCAC TATTAAAGAA CGTGGACTCC AACGTCAAAG 
5581 GGCGAAAAAC CGTCTATCAG GGCGATGGCC CACTACGTGA ACCATCACCC TAATCAAGTT 
5641 TTTTGGGGTC GAGGTGCCGT AAAGCACTAA ATCGGAACCC TAAAGGGAGC CCCCGATTTA 
5701 GAGCTTGACG GGGAAAGCCG GCGAACGTGG CGAGAAAGGA AGGGAAGAAA GCGAAAGGAG 
5761 CGGGCGCTAG GGCGCTGGCA AGTGTAGGGG TCACGCTGCG CGTAACCACC ACACCCGCCG 
5821 CGCTTAATGC GCCGCTACAG GGCGCGTCCA TTCGCCATTC AGGCTGCGCA ACTGTTGGGA 
5881 AGGGCGATCG GTGCGGGCCT CTTCGCTATT ACGCCAGCTG GCGAAAGGGG GATGTGCTGC 
5941 AAGGCGATTA AGTTGGG 
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CMV PROMOTER FOR EUKARYOTIC EXPRESSION 58/253 

cca ttg acg caa atg ggc ggt agg cgt gta egg tgg gag gtc tat ata age 
ggt aac tgc gtt tac ccg cca tec gca cat gec occ etc cog ata tat teg 

r-*- mRNA START 

oga get cgt tta gtg aac cgt cog ate gec tgg aga cgc cot cca cgc tgt 
tct cga gca aat cac ttg gca gtc tag egg ace tct gcg gta ggt gcg aca 



CMV ENHANCER/PROMOTER 
1072 ttt gac etc cat aga aga cac egg gac cga tec age" etc egb act eta gec 
aaa ctg gag gta tct tct gtg gee ctg get agg teg gag ge e tga gat egg 



1123 tag gec gcg gag egg ata aca att tea cac agg aaa cog eta tga cca eta 
otc egg cgc etc gec tat tgt taa agt gtg tec ttt gtc gat act ggt gat 

Pst 

1174 ggc ttt tgc aaa aag eta ttt agg tga cac tat ago agg tac gee tgc cj gg 
ccg aaa acg ttt ttc gat aaa tec act gtg ata tct tec atg eggjacg tcE - 

Kpn EcoRI Int ottRI 

1225 tac egg tec g g*o ott ccc ate 



atg gec agg ccttaa] ggg tag 



Eam1105l.. 



Xmnl 
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pDEST7 6025 bp (rotated to position 2800) 



Location (Base Nos.) 
67. .589 
906.. 782 



Gene Encoded 
CMV promoter 
attRl 
CmR 

inactivated ccdA 

ccdB 

attR2 

small t & polyA 
fl 

ampR 
ori 



1015.. 1674 
1794.. 1878 
2016.. 2321 
2362 . . 2486 
2671.. 3033 
3227.. 3502 
3962.. 4822 
5022.. 5661 



1 ATTATCATGA CATTAACCTA TAAAAATAGG CGTAGTACGA GGCCCTTTCA CTCATTAGAT 
61 GCATGTCGTT ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA ACGACCCCCG 
121 CCCATTGACG TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA CTTTCCATTG 
181 ACGTCAATGG GTGGAGTATT TACGGTAAAC TGCCCACTTG GCAGTACATC AAGTGTATCA 
241 TATGCCAAGT ACGCCCCCTA TTGACGTCAA TGACGGTAAA TGGCCCGCCT GGCATTATGC 
301 CCAGTACATG ACCTTATGGG ACTTTCCTAC TTGGCAGTAC ATCTACGTAT TAGTCATCGC 
361 TATTACCATG GTGATGCGGT TTTGGCAGTA CATCAATGGG CGTGGATAGC GGTTTGACTC 
421 ACGGGGATTT CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTGTTTT GGCACCAAAA 
481 TCAACGGGAC TTTCCAAAAT GTCGTAACAA CTCCGCCCCA TTGACGCAAA TGGGCGGTAG 
541 GCGTGTACGG TGGGAGGTCT ATATAAGCAG AGCTCGTTTA GTGAACCGTC AGATCGCCTG 
601 GAGACGCCAT CCACGCTGTT TTGACCTCCA TAGAAGACAC CGGGACCGAT CCAGCCTCCG 
661 GACTCTAGCC TAGGCCGCGG AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATT 
721 AGGCCTTTGC AAAAAGCTAT TTAGGTGACA CTATAGAAGG TACGCCTGCA GGTACCGGAT 
781 CACAAGTTTG TACAAAAAAG CTGAACGAGA AACGTAAAAT GATATAAATA TCAATATATT 
841 AAATTAGATT TTGCATAAAA AACAGACTAC ATAATACTGT AAAACACAAC ATATCCAGTC 
901 ACTATGGCGG CCGCATTAGG CACCCCAGGC TTTACACTTT ATGCTTCCGG CTCGTATAAT 
961 GTGTGGATTT TGAGTTAGGA TCCGTCGAGA TTTTCAGGAG CTAAGGAAGC TAAAATGGAG 
1021 AAAAAAATCA CTGGATATAC CACCGTTGAT ATATCCCAAT GGCATCGTAA AGAACATTTT 
1081 GAG GCATT TC AGTCAGTTGC TCAATGTACC TATAACCAGA CCGTTCAGCT GGATATTACG 
1141 GCCTTTTTAA AGACCGTAAA GAAAAATAAG CACAAGTTTT ATCCGGCCTT TATTCACATT 
1201 CTTGCCCGCC TGATGAATGC TCATCCGGAA TTCCGTATGG CAATGAAAGA CGGTGAGCTG 
1261 GTGATATGGG ATAGTGTTCA CCCTTGTTAC ACCGTTTTCC ATGAGCAAAC TGAAACGTTT 
1321 TCATCGCTCT GGAGTGAATA CCACGACGAT TTCCGGCAGT TTCTACACAT ATATTCGCAA 
1381 GATGT GGCGT GTTACGGTGA AAACCTGGCC TATTTCCCTA AAGGGTTTAT TGAGAATATG 
1441 I I I I ICGTCT CAGCCAATCC CTGGGTGAGT TTCACCAGTT TTGATTTAAA CGTGGCCAAT 
1501 ATGGACAACT TCTTCGCCCC CGTTTTCACC ATGGGCAAAT ATTATACGCA AGGCGACAAG 
1561 GTGCTGATGC CGCTGGCGAT TCAGGTTCAT CATGCCGTCT GTGATGGCTT CCATGTCGGC 
1621 AGAATGCTTA ATGAATTACA ACAGTACTGC GATGAGTGGC AGGGCGGGGC GTAAACGCGT 
1681 GGATCCGGCT TACTAAAAGC CAGATAACAG TATGCGTATT TGCGCGCTGA TTTTTGCGGT 
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1741 ATAAGAATAT ATACTGATAT GTATACCCGA AGTATGTCAA AAAGAGGTGT GCTATGAAGC 
1801 AGCGTATTAC AGTGACAGTT GACAGCGACA GCTATCAGTT GCTCAAGGCA TATATGATGT 
1861 CAATATCTCC GGTCTGGTAA GCACAACCAT GCAGAATGAA GCCCGTCGTC TGCGTGCCGA 
1921 ACGCTGGAAA GCGGAAAATC AGGAAGGGAT GGCTGAGGTC GCCCGGTTTA TTGAAATGAA 
1981 CGGCTCTTTT GCTGACGAGA ACAGGGACTG GTGAAATGCA GTTTAAGGTT TACACCTATA 
2041 AAAGAGAGAG CCGTTATCGT CTGTTTGTGG ATGTACAGAG TGATATTATT GACACGCCCG 
2101 GGCGACGGAT GGTGATCCCC CTGGCCAGTG CACGTCTGCT GTCAGATAAA GTCTCCCGTG 
2161 AACTTTACCC GGTGGTGCAT ATCGGGGATG AAAGCTGGCG CATGATGACC ACCGATATGG 
2221 CCAGTGTGCC GGTCTCCGTT ATCGGGGAAG AAGTGGCTGA TCTCAGCCAC CGCGAAAATG 
2281 ACATCAAAAA CGCCATTAAC CTGATGTTCT GGGGAATATA AATGTCAGGC TCCCTTATAC 
2341 ACAGCCAGTC TGCAGGTCGA CCATAGTGAC TGGATATGTT GTGTTTTACA GTATTATGTA 
2401 GTCTGTTTTT TATGCAAAAT CTAATTTAAT ATATTGATAT TTATATCATT TTACGTTTCT 
2461 CGTTCAGCTT TCTTGTACAA AGTGGTGATC GCGTGCATGC GACGTCATAG CTCTCTCCCT 
2521 ATAGTGAGTC GTATTATAAG CTAGGCACTG GCCGTCGTTT TACAACGTCG TGACTGGGAA 
2581 AACTGCTAGC TTGGGATCTT TGTGAAGGAA CCTTACTTCT GTGGTGTGAC ATAATTGGAC 
2641 AAACTACCTA CAGAGATTTA AAGCTCTAAG GTAAATATAA AATTTTTAAG TGTATAATGT 
2701 GTTAAACTAG CTGCATATGC TTGCTGCTTG AGAGTTTTGC TTACTGAGTA TGATTTATGA 
2761 AAATATTATA CACAGGAGCT AGTGATTCTA ATTGTTTGTG TATTTTAGAT TCACAGTCCC 
2821 AAGGCTCATT TCAGGCCCCT CAGTCCTCAC AGTCTGTTCA TGATCATAAT CAGCCATACC 
2881 ACATTTGTAG AGGTTTTACT TGCTTTAAAA AACCTCCCAC ACCTCCCCCT GAACCTGAAA 
2941 CATAAAATGA ATGCAATTGT TGTTGTTAAC TTGTTTATTG CAGCTTATAA TGGTTACAAA 
3001 TAAAGCAATA GCATCACAAA TTTCACAAAT AAAGCATTTT TTTCACTGCA TTCTAGTTGT 
3061 GGTTTGTCCA AACTCATCAA TGTATCTTAT CATGTCTGGA TCGATCCTGC ATTAATGAAT 
3121 CGGCCAACGC GCGGGGAGAG GCGGTTTGCG TATTGGCTGG CGTAATAGCG AAGAGGCCCG 
3181 CACCGATCGC CCTTCCCAAC AGTTGCGCAG CCTGAATGGC GAATGGGACG CGCCCTGTAG 
3241 CGGCGCATTA AGCGCGGCGG GTGTGGTGGT TACGCGCAGC GTGACCGCTA CACTTGCCAG 
3301 CGCCCTAGCG CCCGCTCCTT TCGCTTTCTT CCCTTCCTTT CTCGCCACGT TCGCCGGCTT 
3361 TCCCCGTCAA GCTCTAAATC GGGGGCTCCC TTTAGGGTTC CGATTTAGTG CTTTACGGCA 
3421 CCTCGACCCC AAAAAACTTG ATTAGGGTGA TGGTTCACGT AGTGGGCCAT CGCCCTGATA 
3481 GACGG I II I I CGCCCTTTGA CGTTGGAGTC CACGTTCTTT AATAGTGGAC TCTTGTTCCA 
3541 AACTGGAACA ACACTCAACC CTATCTCGGT CTATTCTTTT GATTTATAAG GGATTTTGCC 
3601 GATTTCGGCC TATTGGTTAA AAAATGAGCT GATTTAACAA AAATTTAACG CGAATTTTAA 
3661 CAAAATATTA ACGTTTACAA TTTCAGGTGG CACTTTTCGG GGAAATGTGC GCGGAACCCC 
3721 TATTTGTTTA TTTTTCTAAA TACATTCAAA TATGTATCCG CTCATGCCAG GTCTTGGACT 
3781 GGTGAGAACG GCTTGCTCGG CAGCTTCGAT GTGTGCTGGA GGGAGAATAA AGGTCTAAGA 
3841 TGTGCGATAG AGGGAAGTCG CATTGAATTA TGTGCTGTGT AGGGATCGCT GGTATCAAAT 
3901 ATGTGTGCCC ACCCCTGGCA TGAGACAATA ACCCTGATAA ATGCTTCAAT AATATTGAAA 
3961 AAGGAAGAGT ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 
4021 TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG CTGAAGATCA 
4081 GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC AGCGGTAAGA TCCTTGAGAG 
4141 TTTTCGCCCC GAAGAACGTT TTCCAATGAT GAGCACTTTT AAAGTTCTGC TATGTGGCGC 
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4201 GGTATTATCC CGTATTGACG CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA 
4261 GAATGACTTG GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 
4321 AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA ACTTACTTCT 
4381 GACAACGATC GGAGGACCGA AGGAGCTAAC CGC I I I I I I G CACAACATGG GGGATCATGT 
4441 AACTCGCCTT GATCGTTGGG AACCGGAGCT GAATGAAGCC ATACCAAACG ACGAGCGTGA 
4501 CACCACGATG CCTGTAGCAA TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT 
4561 TACTCTAGCT TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 
4621 ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG GAGCCGGTGA 
4681 GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT GGTAAGCCCT CCCGTATCGT 
4741 AGTTATCTAC ACGACGGGGA GTCAGGCAAC TATGGATGAA CGAAATAGAC AGATCGCTGA 
4801 GATAGGTGCC TCACTGATTA AGCATTGGTA ACTGTCAGAC CAAGTTTACT CATATATACT 
4861 TTAGATTGAT TTAAAACTTC ATTTTTAATT TAAAAGGATC TAGGTGAAGA TCCTTTTTGA 
4921 TAATCTCATG CCATAACTTC GTATAATGTA TGCTATACGA AGTTATGGCA TGACCAAAAT 
4981 CCCTTAACGT GAGTTTTCGT TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC 
5041 TTCTTGAGAT CCI I I I 1 1 IC TGCGCGTAAT CTGCTGCTTG CAAACAAAAA AACCACCGCT 
5101 ACCAGCGGTG GTTTGTTTGC CGGATCAAGA GCTACCAACT CTTTTTCCGA AGGTAACTGG 
5161 CTTCAGCAGA GCGCAGATAC CAAATACTGT CCTTCTAGTG TAGCCGTAGT TAGGCCACCA 
5221 CTTCAAGAAC TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 
5281 TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTACCGGA 
5341 TAAGGCGCAG CGGTCGGGCT GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC 
5401 GACCTACACC GAACTGAGAT ACCTACAGCG TGAGCATTGA GAAAGCGCCA CGCTTCCCGA 
5461 AGGGAGAAAG GCGGACAGGT ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG 
5521 GGAGCTTCCA GGGGGAAACG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG 
5581 ACTTGAGCGT CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG 
5641 CAACGCGGCC I I I I IACGGT TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTTCC 
5701 TGCGTTATCC CCTGATTCTG TGGATAACCG TATTACCGCC TTTGAGTGAG CTGATACCGC 
5761 TCGCCGCAGC CGAACGACCG AGCGCAGCGA GTCAGTGAGC GAGGAAGCGG AAGAGCGCCC 
5821 AATACGCAAA CCGCCTCTCC CCGCGCGTTG GCCGATTCAT TAATGCAGAG CTTGCAATTC 
5881 GCGCGTTTTT CAATATTATT GAAGCATTTA TCAGGGTTAT TGTCTCATGA GCGGATACAT 
5941 ATTTGAATGT ATTTAGAAAA ATAAACAAAT AGGGGTTCCG CGCACATTTC CCCGAAAAGT 
6001 GCCACCTGAC GTCTAAGAAA CCATT 
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pDEST8 Polyhedron Promoter, Baculovirus Transfer Plasmid 



AccI 

1 cgt [ata etc egg aat 
gca tajt gag gec tta 



52 tct cgc aaa taa ata 
aga gcg ttt att tat 



att aat aga tea tgg aga 

taa tta tct agt acc tct 

r— mRNA (polyhedrin) 

logt att tta ctg ttt teg 

tea taa aat gac aaa age 



taa tta aaa tga taa cca 
att aat ttt act att ggt 



taa cag ttt tgt aat aaa 
att gtc aaa aca tta ttt 



103 aaa acc tat aaa tat tec gga tta ttc ata ccg tec cac cat egg gcg egg 
ttt tgg ata ttt ata agg cct aat aag tat ggc agg gtg gta gee cgc gee 

(Bam) 

154 atel ate 
tag tag 



Int attRI 

aca agt tt |g toe aaa aaa get gaa cga gaa acg taa aat gat ata, 
tgt tea aac atg tttl ttt cga ctt get ctt tgc att tta eta tat/ 



t 



SnoB I 6525 
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pDEST8 6526 bp 



Location (Base Nos.) 
23.. 152 
284.. 160 



Ppolh 
attRl 
CmR 



Gene Encoded 



534.. 1193 
1313.. 1397 
1535 . . 1840 
1881.. 2005 
2766.. 3146 
3240.. 4090 
4289..4869 
5564.. 6496 



genR 



ampR 
on' 



ccdB 

attR2 

fl 



inactivated ccdA 



1 CGTATACTCC GGAATATTAA TAGATCATGG AGATAATTAA AATGATAACC ATCTCGCAAA 
61 TAAATAAGTA TTTTACTGTT TTCGTAACAG TTTTGTAATA AAAAAACCTA TAAATATTCC 
121 GGATTATTCA TACCGTCCCA CCATCGGGCG CGGATCATCA CAAGTTTGTA CAAAAAAGCT 
181 GAACGAGAAA CGTAAAATGA TATAAATATC AATATATTAA ATTAGATTTT GCATAAAAAA 
241 CAGACTACAT AATACTGTAA AACACAACAT ATCCAGTCAC TATGGCGGCC GCTAAGTTGG 
301 CAGCATCACC CGACGCACTT TGCGCCGAAT AAATACCTGT GACGGAAGAT CACTTCGCAG 
361 AATAAATAAA TCCTGGTGTC CCTGTTGATA CCGGGAAGCC CTGGGCCAAC TTTTGGCGAA 
421 AATGAGACGT T GATCGG CAC GTAAGAGGTT CCAACTTTCA CCATAATGAA ATAAGATCAC 
481 TACCGGGCGT Al I 1 1 1 IGAG TTATCGAGAT TTTCAGGAGC TAAGGAAGCT AAAATGGAGA 
541 AAAAAATCAC TGGATATACC ACCGTTGATA TATCCCAATG GCATCGTAAA GAACATTTTG 
601 AG GCATTT CA GTCAGTTGCT CAATGTACCT ATAACCAGAC CGTTCAGCTG GATA™CGG 
661 CCTTTTTAAA GACCGTAAAG AAAAATMGC ACAAGTTTTA TCCGGCCTTT ATTCACATTC 
721 TTGCCCGCCT GATGAATGCT CATCCGGAAT TCCGTATGGC AATGAAAGAC GGTGAGCTGG 
781 TGATATGGGA TAGTGTTCAC CCTTGTTACA CCGTTTTCCA TGAGCAAACT GAAACGTTTT 
841 CATCGCTCTG GAGTGAATAC CACGACGATT TCCGGCAGTT TCTACACATA TATTCGCAAG 
901 ATGT GGCGTG TTACGGTGAA AACCTGGCCT ATTTCCCTAA AGGGTTTATT GAGAATATGT 
961 TTTTCGTCTC AGCCAATCCC TGGGTGAGTT TCACCAGTTT TGATTTAAAC GTGGCCAATA 
1021 TGGACAACTT CTTCGCCCCC GTTTTCACCA TGGGCAAATA TTATACGCAA GGCGACAAGG 
1081 TGCTGATGCC GCTGGCGATT CAGGTTCATC ATGCCGTCTG TGATGGCTTC CATGTCGGCA 
1141 GMTGCTTAA TGAATTACAA CAGTACTGCG ATGAGTGGCA GGGCGGGGCG TAAACGCGTG 
1201 GATCCGGCTT ACTAAAAGCC AGATAACAGT ATGCGTATTT GCGCGCTGAT TTTTGCGGTA 
1261 TAAGAATATA TACTGATATG TATACCCGAA GTATGTCAAA AAGAGGTGTG CTATGAAGCA 
1321 GCGTATTACA GTGACAGTTG ACAGCGACAG CTATCAGTTG CTCAAGGCAT ATATGATGTC 
1381 AATATCTCCG GTCTGGTAAG CACAACCATG CAGAATGAAG CCCGTCGTCT GCGTGCCGAA 
1441 CGCTG GAAA G CGGAAAATCA GGAAGGGATG GCTGAGGTCG CCCGGTTTAT TGAAATGAAC 
1501 GGCTCTTTTG CTGACGAGAA CAGGGACTGG TGAAATGCAG TTTAAGGTTT ACACCTATAA 
1561 AAGAGAGAGC CGTTATCGTC TGTTTGTGGA TGTACAGAGT GATATTATTG ACACGCCCGG 
1621 GCGACGGATG GTGATCCCCC TGGCCAGTGC ACGTCTGCTG TCAGATAAAG TCTCCCGTGA 
1681 ACTTTACCCG GTGGTGCATA TCGGGGATGA AAGCTGGCGC ATGATGACCA CCGATATGGC 
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1741 CAGTGTGCCG GTCTCCGTTA TCGGGGAAGA AGTGGCTGAT CTCAGCCACC GCGAAAATGA 
1801 CATCAAAAAC GCCATTAACC TGATGTTCTG GGGAATATAA ATGTCAGGCT CCCTTATACA 
1861 CAGC CAGTCT GCAGGTCGAC CATAGTGACT GGATATGTTG TGTTTTACAG TATTATGTAG 
1921 TCTGi I I I I I ATGCAAAATC TAATTTAATA TATTGATATT TATATCATTT TACGTTTCTC 
1981 GTTCAGCTTT CTTGTACAAA GTGGTGATAG CTTGTCGAGA AGTACTAGAG GATCATAATC 
2041 AGCCATACCA CATTTGTAGA GGTTTTACTT GCTTTAAAAA ACCTCCCACA CCTCCCCCTG 
2101 AACCTGAAAC ATAAAATGAA TGCAATTGTT GTTGTTAACT TGTTTATTGC AGCTTATAAT 
2161 GGTTACAAAT AAAGCAATAG CATCACAAAT TTCACAAATA AAGCATTTTT TTCACTGCAT 
2221 TCTAGTTGTG GTTTGTCCAA ACTCATCAAT GTATCTTATC ATGTCTGGAT CTGATCACTG 
2281 CTTGAGCCTA GGAGATCCGA ACCAGATAAG TGAAATCTAG TTCCAAACTA TTTTGTCATT 
2341 TTTAATTTTC GTATTAGCTT ACGACGCTAC ACCCAGTTCC CATCTATTTT GTCACTCTTC 
2401 CCTAAATAAT CCT TAAAAA C TCCATTTCCA CCCCTCCCAG TTCCCAACTA TTTTGTCCGC 
2461 CCACAGCGGG GCATTTTTCT TCCTGTTATG TTTTTAATCA AACATCCTGC CAACTCCATG 
2521 TGACAAACCG TCATCTTCGG CTACTTTTTC TCTGTCACAG AATGAAAATT TTTCTGTCAT 
2581 CTCTTCGTTA TTAATGTTTG TAATTGACTG AATATCAACG CTTATTTGCA GCCTGAATGG 
2641 CGAATGGACG CGCCCTGTAG CGGCGCATTA AGCGCGGCGG GTGTGGTGGT TACGCGCAGC 
2701 GTGACCGCTA CACTTGCCAG CGCCCTAGCG CCCGCTCCTT TCGCTTTCTT CCCTTCCTTT 
2761 CTCGCCACGT TCGCCGGCTT TCCCCGTCAA GCTCTAAATC GGGGGCTCCC TTTAGGGTTC 
2821 CGATTTAGTG CTTTACGGCA CCTCGACCCC AAAAAACTTG ATTAGGGTGA TGGTTCACGT 
2881 AGTGGGCCAT CGCCCTGATA GACGGTTTTT CGCCCTTTGA CGTTGGAGTC CACGTTCTTT 
2941 AATAGTGGAC TCTTGTTCCA AACTGGAACA ACACTCAACC CTATCTCGGT CTATTCTTTT 
3001 GATTTATAAG GGATTTTGCC GATTTCGGCC TATTGGTTAA AAAATGAGCT GATTTAACAA 
3061 AAATTTAACG CGAATTTTAA CAAAATATTA ACGTTTACAA TTTCAGGTGG CACTTTTCGG 
3121 GGAAATGTGC GCGGAACCCC TATTTGTTTA TTTTTCTAAA TACATTCAAA TATGTATCCG 
3181 CTCATGAGAC AATAACCCTG ATAAATGCTT CAATAATATT GAAAAAGGAA GAGTATGAGT 
3241 ATTCAACATT TCCGTGTCGC CCTTATTCCC IIIIIIGCGG CATTTTGCCT TCCTGTTTTT 
3301 GCTCACCCAG AAACGCTGGT GAMGTAAAA GATGCTGAAG ATCAGTTGGG TGCACGAGTG 
3361 GGTTACATCG AACTGGATCT CAACAGCGGT AAGATCCTTG AGAGTTTTCG CCCCGAAGAA 
3421 CGTTTTCCAA TGATGAGCAC TTTTAAAGTT CTGCTATGTG GCGCGGTATT ATCCCGTATT 
3481 GACGCCGGGC AAGAGCAACT CGGTCGCCGC ATACACTATT CTCAGAATGA CTTGGTTGAG 
3541 TACTCACCAG TCACAGAAAA GCATCTTACG GATGGCATGA CAGTAAGAGA ATTATGCAGT 
3601 GCTGCCATAA CCATGAGTGA TAACACTGCG GCCAACTTAC TTCTGACAAC GATCGGAGGA 
3661 CCGAAGGAGC TAACCGCTTT TTTGCACAAC ATGGGGGATC ATGTAACTCG CCTTGATCGT 
3721 TGGGAACCGG AGCTGAATGA AGCCATACCA AACGACGAGC GTGACACCAC GATGCCTGTA 
3781 GCAATGGCAA CAACGTTGCG CAAACTATTA ACTGGCGAAC TACTTACTCT AGCTTCCCGG 
3841 CAACAATTAA TAGACTGGAT GGAGGCGGAT AAAGTTGCAG GACCACTTCT GCGCTCGGCC 
3901 CTTCCGGCTG GCTGGTTTAT TGCTGATAAA TCTGGAGCCG GTGAGCGTGG GTCTCGCGGT 
3961 ATCATTGCAG CACTGGGGCC AGATGGTAAG CCCTCCCGTA TCGTAGTTAT CTACACGACG 
4021 GGGAGTCAGG CAACTATGGA TGAACGAAAT AGACAGATCG CTGAGATAGG TGCCTCACTG 
4081 ATTAA GCATT GGTAACTGTC AGACCAAGTT TACTCATATA TACTTTAGAT TGATTTAAAA 
4141 CTTCATTTTT AATTTAAAAG GATCTAGGTG AAGATCCTTT TTGATAATCT CATGACCAAA 
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4201 ATCCCTTAAC GTGAGTTTTC GTTCCACTGA GCGTCAGACC CCGTAGAAAA GATCAAAGGA 
4261 TCTTCTTGAG ATCCI I 1 1 I I TCTGCGCGTA ATCTGCTGCT TGCAAACAAA AAAACCACCG 
4321 CTACCAGCGG TGGTTTGTTT GCCGGATCAA GAGCTACCAA CTCTTTTTCC GAAGGTAACT 
4381 GGCTTCAGCA GAGCGCAGAT ACCAAATACT GTCCTTCTAG TGTAGCCGTA GTTAGGCCAC 
4441 CACTTCAAGA ACTCTGTAGC ACCGCCTACA TACCTCGCTC TGCTAATCCT GTTACCAGTG 
4501 GCTGCTGCCA GTGGCGATAA GTCGTGTCTT ACCGGGTTGG ACTCAAGACG ATAGTTACCG 
4561 GATAAGGCGC AGCGGTCGGG CTGAACGGGG GGTTCGTGCA CACAGCCCAG CTTGGAGCGA 
4621 ACGACCTACA CCGAACTGAG ATACCTACAG CGTGAGCATT GAGAAAGCGC CACGCTTCCC 
4681 GAAGGGAGAA AGGCGGACAG GTATCCGGTA AGCGGCAGGG TCGGAACAGG AGAGCGCACG 
4741 AGGGAGCTTC CAGGGGGAAA CGCCTGGTAT CTTTATAGTC CTGTCGGGTT TCGCCACCTC 
4801 TGACTTGAGC GTCGATTTTT GTGATGCTCG TCAGGGGGGC GGAGCCTATG GAAAAACGCC 
4861 AGCAACGCGG CCTTTTTACG GTTCCTGGCC TTTTGCTGGC CTTTTGCTCA CATGTTCTTT 
4921 CCTGCGTTAT CCCCTGATTC TGTGGATAAC CGTATTACCG CCTTTGAGTG AGCTGATACC 
4981 GCTCGCCGCA GCCGAACGAC CGAGCGCAGC GAGTCAGTGA GCGAGGAAGC GGAAGAGCGC 
5041 CTGATGCGGT ATTTTCTCCT TACGCATCTG TGCGGTATTT CACACCGCAG ACCAGCCGCG 
5101 TAACCTGGCA AAATCGGTTA CGGTTGAGTA ATAAATGGAT GCCCTGCGTA AGCGGGTGTG 
5161 GGCGGACAAT AAAGTCTTAA ACTGAACAAA ATAGATCTAA ACTATGACAA TAAAGTCTTA 
5221 AACTAGACAG AATAGTTGTA AACTGAAATC AGTCCAGTTA TGCTGTGAAA AAGCATACTG 
5281 GACTTTTGTT ATGGCTAAAG CAAACTCTTC ATTTTCTGAA GTGCAAATTG CCCGTCGTAT 
5341 TAAAGAGGGG CGtGGCCAAG GGCATGGTAA AGACTATATT CGCGGCGTTG TGACAATTTA 
5401 CCGAACAACT CCGCGGCCGG GAAGCCGATC TCGGCTTGAA CGAATTGTTA GGTGGCGGTA 
5461 CTTGGGTCGA TATCAAAGTG CATCACTTCT TCCCGTATGC CCAACTTTGT ATAGAGAGCC 
5521 ACTGCGGGAT CGTCACCGTA ATCTGCTTGC ACGTAGATCA CATAAGCACC AAGCGCGTTG 
5581 GCCTCATGCT TGAGGAGATT GATGAGCGCG GTGGCAATGC CCTGCCTCCG GTGCTCGCCG 
5641 GAGACTGCGA GATCATAGAT ATAGATCTCA CTACGCGGCT GCTCAAACCT GGGCAGAACG 
5701 TAAGCCGCGA GAGCGCCAAC AACCGCTTCT TGGTCGAAGG CAGCAAGCGC GATGAATGTC 
5761 TTACTACGGA GCAAGTTCCC GAGGTAATCG GAGTCCGGCT GATGTTGGGA GTAGGTGGCT 
5821 ACGTCTCCGA ACTCACGACC GAAAAGATCA AGAGCAGCCC GCATGGATTT GACTTGGTCA 
5881 GGGCCGAGCC TACATGTGCG AATGATGCCC ATACTTGAGC CACCTAACTT TGTTTTAGGG 
5941 CGACTGCCCT GCTGCGTAAC ATCGTTGCTG CTGCGTAACA TCGTTGCTGC TCCATAACAT 
6001 CAAACATCGA CCCACGGCGT AACGCGCTTG CTGCTTGGAT GCCCGAGGCA TAGACTGTAC 
6061 AAAAAAACAG TCATAACAAG CCATGAAAAC CGCCACTGCG CCGTTACCAC CGCTGCGTTC 
6121 GGTCAAGGTT CTGGACCAGT TGCGTGAGCG CATACGCTAC TTGCATTACA GTTTACGAAC 
6181 CGAACAGGCT TATGTCAACT GGGTTCGTGC CTTCATCCGT TTCCACGGTG TGCGTCACCC 
6241 GGCAACCTTG GGCAGCAGCG AAGTCGAGGC ATTTCTGTCC TGGCTGGCGA ACGAGCGCAA 
6301 GGTTTCGGTC TCCACGCATC GTCAGGCATT GGCGGCCTTG CTGTTCTTCT ACGGCAAGGT 
6361 GCTGTGCACG GATCTGCCCT GGCTTCAGGA GATCGGAAGA CCTCGGCCGT CGCGGCGCTT 
6421 GCCGGTGGTG CTGACCCCGG ATGAAGTGGT TCGCATCCTC GGTTTTCTGG AAGGCGAGCA 
6481 TCGTTTGTTC GCCCAGGACT CTAGCTATAG TTCTAGTGGT TGGCTA 
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103 ttg gcg agg gac att oag gcg ttt oag aaa ttg aga gga cct gtt ata 
ooc cgc tec ctg too ttc cgc aaa ttc ttt aac tct cct gga caa tat 
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-26s RNA 
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pDEST9 12464 bp 



Location (Base Nos.) 
355. .232 



Gene Encoded 

attRl 

CmR . 

inactivated ccdA 

ccdB 

attR2 

ori 

ampR 

SP6 promoter 
nsPl : non - structural 
nsP2 : non - structural 
nsP3 : non - structural 
nsP4 : non - structural 



605.. 1264 
1384 . . 1468 
1606.. 1911 
1952.. 2078 
2532.. 2782 
3482.. 4282 
5232.. 5365 
5365. .6965 
6965. .9265 
9265.. 10865 
10865.. 161 



protein 1 
protein 2 
protein 3 
protein 4 



1 AGCAAGTGGT TCCGGACAGG CTTGGGGGCC GAACTGGAGG TGGCACTAAC ATCTAGGTAT 
61 GAGGTAGAGG GCTGCAAAAG TATCCTCATA GCCATGGCCA CCTTGGCGAG GGACATTAAG 
121 GCGTTTAAGA AATTGAGAGG ACCTGTTATA CACCTCTACG GCGGTCCTAG ATTGGTGCGT 
181 TAATACACAG AATTCTGATT GGATCCCGGT CCGAAGCGCG CTTTCCCATC ACAAGTTTGT 
241 ACAAAAAAGC TGAACGAGAA ACGTAAAATG ATATAAATAT CAATATATTA AATTAGATTT 
301 TGCATAAAAA ACAGACTACA TAATACTGTA AAACACAACA TATCCAGTCA CTATGGCGGC 
361 CGCTAAGTTG GCAGCATCAC CCGACGCACT TTGCGCCGAA TAAATACCTG TGACGGAAGA 
"421 T CACTT CGCA GAATAAATAA ATCCTGGTGT CCCTGTTGAT ACCGGGAAGC CCTGGGCCAA 
481 CTTTTGGCGA AAATGAGACG TT GATCGG CA CGTAAGAGGT TCCAACTTTC ACCATAATGA 
541 AATAAGATCA CTACCGGGCG TAI 1 1 1 1 IGA GTTATCGAGA TTTTCAGGAG CTAAGGAAGC 
601 TAAAATGGAG AAAAAAATCA CTGGATATAC CACCGTTGAT ATATCCCAAT GGCATCGTAA 
661 AGAACATTTT GAG GCATT TC AGTCAGTTGC TCAATGTACC TATAACCAGA CCGTTCAGCT 
721 GGATATTACG GCCTTTTTAA AGACCGTAAA GAAAAATAAG CACAAGTTTT ATCCGGCCTT 
781 TATTCACATT CTTGCCCGCC TGATGAATGC TCATCCGGAA TTCCGTATGG CAATGAAAGA 
841 CGGTGAGCTG GTGATATGGG ATAGTGTTCA CCCTTGTTAC ACCGTTTTCC ATGAGCAAAC 
901 TGAAACGTTT TCATCGCTCT GGAGTGAATA CCACGACGAT TTCCGGCAGT TTCTACACAT 
961 ATATTCGCAA GATGT GGCGT GTTACGGTGA AAACCTGGCC TATTTCCCTA AAGGGTTTAT 
1021 TGAGAATATG TTTTTCGTCT CAGCCAATCC CTGGGTGAGT TTCACCAGTT TTGATTTAAA 
1081 CGTGGCCAAT ATGGACAACT TCTTCGCCCC CGTTTTCACC ATGGGCAAAT ATTATACGCA 
1141 AGGCGACAAG GTGCTGATGC CGCTGGCGAT TCAGGTTCAT CATGCCGTCT GTGATGGCTT 
1201 CCATGTCGGC AGAATGCTTA ATGAATTACA ACAGTACTGC GATGAGTGGC AGGGCGGGGC 
1261 GTAAA GATCT GGATCCGGCT TACTAAAAGC CAGATAACAG TATGCGTATT TGCGCGCTGA 
1321 I I I I IGCGGT ATAAGMTAT ATACTGATAT GTATACCCGA AGTATGTCAA AAAGAGGTGT 
1381 GCTATGAAGC AGCGTATTAC AGTGACAGTT GACAGCGACA GCTATCAGTT GCTCAAGGCA 
1441 TATATGATGT CAATATCTCC GGTCTGGTAA GCACAACCAT GCAGAATGAA GCCCGTCGTC 
1501 TGCGTGCCGA ACGCTGGAAA GCGGAAAATC AGGAAGGGAT GGCTGAGGTC GCCCGGTTTA 
1561 TTGAAATGAA CGGCTCTTTT GCTGACGAGA ACAGGGACTG GTGAAATGCA GTTTAAGGTT 
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1621 TACACCTATA AAAGAGAGAG CCGTTATCGT CTGTTTGTGG ATGTACAGAG TGATATTATT 
1681 GACACGCCCG GGCGACGGAT GGTGATCCCC CTGGCCAGTG CACGTCTGCT GTCAGATAAA 
1741 GTCTCCCGTG AACTTTACCC GGTGGTGCAT ATCGGGGATG AAAGCTGGCG CATGATGACC 
1801 ACCGATATGG CCAGTGTGCC GGTCTCCGTT ATCGGGGAAG AAGTGGCTGA TCTCAGCCAC 
1861 CGCGAAAATG ACATCAAAAA CGCCATTAAC CTGATGTTCT GGGGAATATA AATGTCAGGC 
1921 TCCGTTATAC ACAGCCAGTC TGCAGGTCGA CCATAGTGAC TGGATATGTT GTGTTTTACA 
1981 GTATTATGTA GTCTGTTTTT TATGCAAAAG TGCTAATTTA ATATATTGAT ATTTATATCA 
2041 TTTTACGTTT CTCGTTCAGC TTTCTTGTAC AAAGTGGTGA TGGGAACTCG AGTTCACTAG 
2101 TCGATCCCGC GGCCGCTTTC GAACCTAGGC AAGCATGCGG GCCCAGTGGG TAATTAATTG 
2161 AATTACATCC CTACGCAAAC GTTTTACGGC CGCCGGTGGC GCCCGCGCCC GGCGGCCCGT 
2221 CCTTGGCCGT TGCAGGCCAC TCCGGTGGCT CCCGTCGTCC CCGACTTCCA GGCCCAGCAG 
2281 ATGCAGCAAC TCATCAGCGC CGTAAATGCG CTGACAATGA GACAGAACGC AATTGCTCCT 
2341 GCTAGGAGCT TAATTCGACG AATAATTGGA TTTTTATTTT ATTTTGCAAT TGGTTTTTAA 
2401 TATTTCCAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
2461 AAAAAAAAAA AAAAAAACTA GAAATCGCGA TTTCTAGTCT GCATTAATGA ATCGGCCAAC 
2521 GCGCGGGGAG AGGCGGTTTG CGTATTGGGC GCTCTTCCGC TTCCTCGCTC ACTGACTCGC 
2581 TGCGCTCGGT CGTTCGGCTG CGGCGAGCGG TATCAGCTCA CTCAAAGGCG GTAATACGGT 
2641 TATCCACAGA ATCAGGGGAT AACGCAGGAA AGAACATGTG AGCAAAAGGC CAGCAAAAGG 
2701 CCAGGAACCG TAAAAAGGCC GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC CCCCCTGACG 
2761 AGCATCACAA AAATCGACGC TCAAGTCAGA GGTGGCGAAA CCCGACAGGA CTATAAAGAT 
2821 ACCAGGCGTT TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC TGTTCCGACC CTGCCGCTTA 
2881 CCGGATACCT GTCCGCCTTT CTCCCTTCGG GAAGCGTGGC GCTTTCTCAA TGCTCGCGCT 
2941 GTAGGTATCT CAGTTCGGTG TAGGTCGTTC GCTCCMGCT GGGCTGTGTG CACGAACCCC 
3001 CCGTTCAGCC CGACCGCTGC GCCTTATCCG GTAACTATCG TCTTGAGTCC AACCCGGTAA 
3061 GACACGACTT ATCGCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA GCGAGGTATG 
3121 TAGGCGGTGC TACAGAGTTC TTGAAGTGGT GGCCTAACTA CGGCTACACT AGAAGGACAG 
3181 TATTTGGTAT CTGCGCTCTG CTGAAGCCAG TTACCTTCGG AAAAAGAGTT GGTAGCTCTT 
3241 GATCCGGCAA ACAAACCACC GCTGGTAGCG GTGGI I I I I I TGTTTGCAAG CAGCAGATTA 
3301 CGCGCAGAAA AAAAGGATCT CAAGAAGATC CTTTGATCTT TTCTACGGGG TCTGACGCTC 
3361 AGTGGMCGA AAACTCACGT TAAGGGATTT TGGTCATGAG ATTATCAAAA AGGATCTTCA 
3421 CCTAGATCCT TTTAAATTAA AAATGAAGTT TTAAATCAAT CTAAAGTATA TATGAGTAAA 
3481 CTTGGTCTGA CAGTTACCAA TGCTTAATCA GTGAGGCACC TATCTCAGCG ATCTGTCTAT 
3541 TTCGTTCATC CATAGTTGCC TGACTCCCCG TCGTGTAGAT AACTACGATA CGGGAGGGCT 
3601 TACCATCTGG CCCCAGTGCT GCAATGATAC CGCGAGACCC ACGCTCACCG GCTCCAGATT 
3661 TATCAGCAAT AAACCAGCCA GCCGGAAGGG CCGAGCGCAG AAGTGGTCCT GCAACTTTAT 
3721 CCGCCTCCAT CCAGTCTATT AATTGTTGCC GGGAAGCTAG AGTAAGTAGT TCGCCAGTTA 
3781 ATAGTTTGCG CAACGTTGTT GCCATTGCTA CAGGCATCGT GGTGTCACGC TCGTCGTTTG 
3841 GTATGGCTTC ATTCAGCTCC GGTTCCCAAC GATCAAGGCG AGTTACATGA TCCCCCATGT 
3901 TGTGCAAAAA AGCGGTTAGC TCCTTCGGTC CTCCGATCGT TGTCAGAAGT AAGTTGGCCG 
3961 CAGTGTTATC ACTCATGGTT ATGGCAGCAC TGCATAATTC TCTTACTGTC ATGCCATCCG 
4021 TAAGATGCTT TTCTGTGACT GGTGAGTACT CAACCAAGTC ATTCTGAGAA TAGTGTATGC 
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4081 GGCGACCGAG TTGCTCTTGC CCGGCGTCAA TACGGGATAA TACCGCGCCA CATAGGAGAA 
4141 CTTTAAAAGT GCTCATCATT GGAAAACGTT CTTCGGGGCG AAAACTCTCA AGGATCTTAC 
4201 CGCTGTTGAG ATCCAGTTCG ATGTAACCCA CTCGTGCACC CAACTGATCT TCAGCATCTT 
4261 TTACTTTCAC CAGCGTTTCT GGGTGAGCAA AAACAGGAAG GCAAAATGCC GCAAAAAAGG 
4321 GAATAAGGGC GACACGGAAA TGTTGAATAC TCATACTCTT CCTTTTTCAA TATTATTGAA 
4381 GCATTTATCA GGGTTATTGT CTCATGAGCG GATACATATT TGAATGTATT TAGAAAAATA 
4441 AACAMTAGG GGTTCCGCGC ACATTTCCCC GAAAAGTGCC ACCTGACGTC TAAGAAACCA 
4501 TTATTATCAT GACATTAACC TATAAAAATA GGCGTATCAC GAGGCCCTTT CGTCTCGCGC 
4561 GTTTCGGTGA TGACGGTGAA AACCTCTGAC ACATGCAGCT CCCGGAGACG GTCACAGCTT 
4621 CTGTCTAAGC GGATGCCGGG AGCAGACAAG CCCGTCAGGG CGCGTCAGCG GGTGTTGGCG 
4681 GGTGTCGGGG CTGGCTTAAC TATGCGGCAT CAGAGCAGAT TGTACTGAGA GTGCACCATA 
4741 TCGACGCTCT CCCTTATGCG ACTCCTGCAT TAGGAAGCAG CCCAGTACTA GGTTGAGGCC 
4801 GTTGAGCACC GCCGCCGCAA GGAATGGTGC ATGCAAGGAG ATGGCGCCCA ACAGTCCCCC 
4861 GGCCACGGGG CCTGCCACCA TACCCACGCC GAAACAAGCG CTCATGAGCC CGAAGTGGCG 
4921 AGCCCGATCT TCCCCATCGG TGATGTCGGC GATATAGGCG CCAGCAACCG CACCTGTGGC 
4981 GCCGGTGATG CCGGCCACGA TGCGTCCGGC GTAGAGGATC TGGCTAGCGA TGACCCTGCT 
5041 GATTGGTTCG CTGACCATTT CCGGGGTGCG GAACGGCGTT ACCAGAAACT CAGAAGGTTC 
5101 GTCCAACCAA ACCGACTCTG ACGGCAGTTT ACGAGAGAGA TGATAGGGTC TGCTTCAGTA 
5161 AGCCAGATGC TACACAATTA GGCTTGTACA TATTGTCGTT AGAACGCGGC TACAATTAAT 
5221 ACATAACCTT ATGTATCATA CACATACGAT TTAGGTGACA CTATAGATGG CGGATGTGTG 
5281 ACATACACGA CGCCAAAAGA TTTTGTTCCA GCTCCTGCCA CCTCCGCTAC GCGAGAGATT 
5341 AACCACCCAC GATGGCCGCC AAAGTGCATG TTGATATTGA GGCTGACAGC CCATTCATCA 
5401 AGTCTTTGCA GAAGGCATTT CCGTCGTTCG AGGTGGAGTC ATTGCAGGTC ACACCAAATG 
5461 ACCATGCAAA TGCCAGAGCA TTTTCGCACC TGGCTACCAA ATTGATCGAG CAGGAGACTG 
5521 ACAAAGACAC ACTCATCTTG GATATCGGCA GTGCGCCTTC CAGGAGAATG ATGTCTACGC 
5581 ACAAATACCA CTGCGTATGC CCTATGCGCA GCGCAGAAGA CCCCGAAAGG CTCGATAGCT 
5641 ACGCAAAGAA ACTGGCAGCG GCCTCCGGGA AGGTGCTGGA TAGAGAGATC GCAGGAAAAA 
5701 TCACCGACCT GCAGACCGTC ATGGCTACGC CAGACGCTGA ATCTCCTACC TTTTGCCTGC 
5761 ATACAGACGT CACGTGTCGT ACGGCAGCCG AAGTGGCCGT ATACCAGGAC GTGTATGCTG 
5821 TACATGCACC AACATCGCTG TACCATCAGG CGATGAAAGG TGTCAGAACG GCGTATTGGA 
5881 TTGGGTTTGA CACCACCCCG TTTATGTTTG ACGCGCTAGC AGGCGCGTAT CCAACCTACG 
5941 CCACAAACTG GGCCGACGAG CAGGTGTTAC AGGCCAGGAA CATAGGACTG TGTGCAGCAT 
6001 CCTTGACTGA GGGAAGACTC GGCAAACTGT CCATTCTCCG CAAGAAGCAA TTGAAACCTT 
6061 GCGACACAGT CATGTTCTCG GTAGGATCTA CATTGTACAC TGAGAGCAGA AAGCTACTGA 
6121 GGAGCTGGCA CTTACCCTCC GTATTCCACC TGAAAGGTAA ACAATCCTTT ACCTGTAGGT 
6181 GCGATACCAT CGTATCATGT GAAGGGTACG TAGTTAAGAA AATCACTATG TGCCCCGGCC 
6241 TGTACGGTAA AACGGTAGGG TACGCCGTGA CGTATCACGC GGAGGGATTC CTAGTGTGCA 
6301 AGACCACAGA CACTGTCAAA GGAGAAAGAG TCTCATTCCC TGTATGCACC TACGTCCCCT 
6361 CAACCATCTG TGATCAAATG ACTGGCATAC TAGCGACCGA CGTCACACCG GAGGACGCAC 
6421 AGAAGTTGTT AGTGGGATTG AATCAGAGGA TAGTTGTGAA CGGAAGAACA CAGCGAAACA 
6481 CTAACACGAT GAAGAACTAT CTGCTTCCGA TTGTGGCCGT CGCATTTAGC AAGTGGGCGA 
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6541 GGGAATACAA GGCAGACCTT GATGATGAAA AACCTCTGGG TGTCCGAGAG AGGTCACTTA 
6601 CTTGCTGCTG CTTGTGGGCA TTTAAAACGA GGAAGATGCA CACCATGTAC AAGAAACCAG 
6661 ACACCCAGAC AATAGTGAAG GTGCCTTCAG AGTTTAACTC GTTCGTCATC CCGAGCCTAT 
6721 GGTCTACAGG CCTCGCAATC CCAGTCAGAT CACGCATTAA GATGCTTTTG GCCAAGAAGA 
6781 CCAAGCGAGA GTTAATACCT GTTCTCGACG CGTCGTCAGC CAGGGATGCT GAACAAGAGG 
6841 AGAAGGAGAG GTTGGAGGCC GAGCTGACTA GAGAAGCCTT ACCACCCCTC GTCCCCATCG 
6901 CGCCGGCGGA GACGGGAGTC GTCGACGTCG ACGTTGAAGA ACTAGAGTAT CACGCAGGTG 
6961 CAGGGGTCGT GGAAACACCT CGCAGCGCGT TGAAAGTCAC CGCACAGCCG AACGACGTAC 
7021 TACTAGGAAA TTACGTAGTT CTGTCCCCGC AGACCGTGCT CAAGAGCTCC AAGTTGGCCC 
7081 CCGTGCACCC TCTAGCAGAG CAGGTGAAAA TAATAACACA TAACGGGAGG GCCGGCGGTT 
7141 ACCAGGTCGA CGGATATGAC GGCAGGGTCC TACTACCATG TGGATCGGCC ATTCCGGTCC 
7201 CTGAGTTTCA GGCTTTGAGC GAGAGCGCCA CTATGGTGTA CAACGAAAGG GAGTTCGTCA 
7261 ACAGGAAACT ATACCATATT GCCGTTCACG GACCCTCGCT GAACACCGAC GAGGAGAACT 
7321 ACGAGAAAGT CAGAGCTGAA AGAACTGACG CCGAGTACGT GTTCGACGTA GATAAAAAAT 
7381 GCTGCGTCAA GAGAGAGGAA GCGTCGGGTT TGGTGTTGGT GGGAGAGCTA ACCAACCCCC 
7441 CGTTCCATGA ATTCGCCTAC GAAGGGCTGA AGATCAGGCC GTCGGCACCA TATAAGACTA 
7501 CAGTAGTAGG AGTCTTTGGG GTTCCGGGAT CAGGCAAGTC TGCTATTATT AAGAGCCTCG 
7561 TGACCAAACA CGATCTGGTC ACCAGCGGCA AGAAGGAGAA CTGCCAGGAA ATAGTTAACG 
7621 ACGTGAAGAA GCACCGCGGG AAGGGGACAA GTAGGGAAAA CAGTGACTCC ATCCTGCTAA 
7681 ACGGGTGTCG TCGTGCCGTG GACATCCTAT ATGTGGACGA GGCTTTCGCT TGCCATTCCG 
7741 GTACTCTGCT GGCCCTAATT GCTCTTGTTA AACCTCGGAG CAAAGTGGTG TTATGCGGAG 
7801 ACCCCAAGCA ATGCGGATTC TTCAATATGA TGCAGCTTAA GGTGAACTTC AACCACAACA 
7861 TCTGCACTGA AGTATGTCAT AAAAGTATAT CCAGACGTTG CACGCGTCCA GTCACGGCCA 
7921 TCGTGTCTAC GTTGCACTAC GGAGGCAAGA TGCGCACGAC CAACCCGTGC AACAAACCCA 
7981 TAATCATAGA CACCACAGGA CAGACCAAGC CCAAGCCAGG AGACATCGTG TTAACATGCT 
8041 TCCGAGGCTG GGCAAAGCAG CTGCAGTTGG ACTACCGTGG ACACGAAGTC ATGACAGCAG 
8101 CAGCATCTCA GGGCCTCACC CGCAAAGGGG TATACGCCGT AAGGCAGAAG GTGAATGAAA 
8161 ATCCCTTGTA TGCCCCTGCG TCGGAGCACG TGAATGTACT GCTGACGCGC ACTGAGGATA 
8221 GGCTGGTGTG GAAAACGCTG GCCGGCGATC CCTGGATTAA GGTCCTATCA AACATTCCAC 
8281 AGGGTAACTT TACGGCCACA TTGGAAGAAT GGCAAGAAGA ACACGACAAA ATAATGAAGG 
8341 TGATTGAAGG ACCGGCTGCG CCTGTGGACG CGTTCCAGAA CAAAGCGAAC GTGTGTTGGG 
8401 CGAAAAGCCT GGTGCCTGTC CTGGACACTG CCGGAATCAG ATTGACAGCA GAGGAGTGGA 
8461 GCACCATAAT TACAGCATTT AAGGAGGACA GAGCTTACTC TCCAGTGGTG GCCTTGAATG 
8521 AAATTTGCAC CAAGTACTAT GGAGTTGACC TGGACAGTGG CCTGTTTTCT GCCCCGAAGG 
8581 TGTCCCTGTA TTACGAGAAC AACCACTGGG ATAACAGACC TGGTGGAAGG ATGTATGGAT 
8641 TCAATGCCGC AACAGCTGCC AGGCTGGAAG CTAGACATAC CTTCCTGAAG GGGCAGTGGC 
8701 ATACGGGCAA GCAGGCAGTT ATCGCAGAAA GAAAAATCCA ACCGCTTTCT GTGCTGGACA 
8761 ATGTAATTCC TATCAACCGC AGGCTGCCGC ACGCCCTGGT GGCTGAGTAC AAGACGGTTA 
8821 AAGGCAGTAG GGTTGAGTGG CTGGTCAATA AAGTAAGAGG GTACCACGTC CTGCTGGTGA 
8881 GTGAGTACAA CCTGGCTTTG CCTCGACGCA GGGTCACTTG GTTGTCACCG CTGAATGTCA 
8941 CAGGCGCCGA TAGGTGCTAC GACCTAAGTT TAGGACTGCC GGCTGACGCC GGCAGGTTCG 
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9001 ACTTGGTCTT TGTGAACATT CACACGGAAT TCAGAATCCA CCACTACCAG CAGTGTGTCG 
9061 ACCACGCCAT GAAGCTGCAG ATGCTTGGGG GAGATGCGCT ACGACTGCTA AAACCCGGCG 
9121 GCATCTTGAT GAGAGCTTAC GGATACGCCG ATAAAATCAG CGMGCCGTT GTTTCCTCCT 
9181 TAAGCAGAAA GTTCTCGTCT GCAAGAGTGT TGCGCCCGGA TTGTGTCACC AGCAATACAG 
9241 AAGTGTTCTT GCTGTTCTCC AACTTTGACA ACGGAAAGAG ACCCTCTACG CTACACCAGA 
9301 TGAATACCAA GCTGAGTGCC GTGTATGCCG GAGAAGCCAT GCACACGGCC GGGTGTGCAC 
9361 CATCCTACAG AGTTAAGAGA GCAGACATAG CCACGTGCAC AGAAGCGGCT GTGGTTAACG 
9421 CAGCTAACGC CCGTGGAACT GTAGGGGATG GCGTATGCAG GGCCGTGGCG AAGAAATGGC 
9481 CGTCAGCCTT TAAGGGAGCA GCAACACCAG TGGGCACAAT TAAAACAGTC ATGTGCGGCT 
9541 CGTACCCCGT CATCCACGCT GTAGCGCCTA ATTTCTCTGC CACGACTGAA GCGGAAGGGG 
9601 ACCGCGAATT GGCCGCTGTC TACCGGGCAG TGGCCGCCGA AGTAAACAGA CTGTCACTGA 
9661 GCAGCGTAGC CATCCCGCTG CTGTCCACAG GAGTGTTCAG CGGCGGAAGA GATAGGCTGC 
9721 AGCAATCCCT CAACCATCTA TTCACAGCAA TGGACGCCAC GGACGCTGAC GTGACCATCT 
9781 ACTGCAGAGA CAAAAGTTGG GAGAAGAAAA TCCAGGAAGC CATTGACATG AGGACGGCTG 
9841 TGGAGTTGCT CAATGATGAC GTGGAGCTGA CCACAGACTT GGTGAGAGTG CACCCGGACA 
9901 GCAGCCTGGT GGGTCGTAAG GGCTACAGTA CCACTGACGG GTCGCTGTAC TCGTACTTTG 
9961 AAGGTACGAA ATTCAACCAG GCTGCTATTG ATATGGCAGA GATACTGACG TTGTGGCCCA 
10021 GACTGCAAGA GGCAAACGAA CAGATATGCC TATACGCGCT GGGCGAAACA ATGGACAACA 
10081 TCAGATCCAA ATGTCCGGTG AACGATTCCG ATTCATCAAC ACCTCCCAGG ACAGTGCCCT 
10141 GCCTGTGCCG CTACGCAATG ACAGCAGAAC GGATCGCCCG CCTTAGGTCA CACCAAGTTA 
10201 AAAGCATGGT GGTTTGCTCA TCTTTTCCCC TCCCGAAATA CCATGTAGAT GGGGTGCAGA 
10261 AGGTAAAGTG CGAGAAGGTT CTCCTGTTCG ACCCGACGGT ACCTTCAGTG GTTAGTCCGC 
10321 GGAAGTATGC CGCATCTACG ACGGACCACT CAGATCGGTC GTTACGAGGG TTTGACTTGG 
10381 ACTGGACCAC CGACTCGTCT TCCACTGCCA GCGATACCAT GTCGCTACCC AGTTTGCAGT 
10441 CGTGTGACAT CGACTCGATC TACGAGCCAA TGGCTCCCAT AGTAGTGACG GCTGACGTAC 
10501 ACCCTGAACC CGCAGGCATC GCGGACCTGG CGGCAGATGT GCACCCTGAA CCCGCAGACC 
10561 ATGTGGACCT GGAGAACCCG ATTCCTCCAC CGCGCCCGAA GAGAGCTGCA TACCTTGCCT 
10621 CCCGCGCGGC GGAGCGACCG GTGCCGGCGC CGAGAAAGCC GACGCCTGCC CCAAGGACTG 
10681 CGTTTAGGAA CAAGCTGCCT TTGACGTTCG GCGACTTTGA CGAGCACGAG GTCGATGCGT 
10741 TGGCCTCCGG GATTACTTTC GGAGACTTCG ACGACGTCCT GCGACTAGGC CGCGCGGGTG 
10801 CATATATTTT CTCCTCGGAC ACTGGCAGCG GACATTTACA ACAAAAATCC GTTAGGCAGC 
10861 ACAATCTCCA GTGCGCACAA CTGGATGCGG TCCAGGAGGA GAAAATGTAC CCGCCAAAAT 
10921 TGGATACTGA GAGGGAGAAG CTGTTGCTGC TGAAAATGCA GATGCACCCA TCGGAGGCTA 
10981 ATAAGAGTCG ATACCAGTCT CGCAAAGTGG AGAACATGAA AGCCACGGTG GTGGACAGGC 
11041 TCACATCGGG GGCCAGATTG TACACGGGAG CGGACGTAGG CCGCATACCA ACATACGCGG 
11101 TTCGGTACCC CCGCCCCGTG TACTCCCCTA CCGTGATCGA AAGATTCTCA AGCCCCGATG 
11161 TAGCAATCGC AGCGTGCAAC GAATACCTAT CCAGAAATTA CCCAACAGTG GCGTCGTACC 
11221 AGATAACAGA TGAATACGAC GCATACTTGG ACATGGTTGA CGGGTCGGAT AGTTGCTTGG 
11281 ACAGAGCGAC ATTCTGCCCG GCGAAGCTCC GGTGCTACCC GAAACATCAT GCGTACCACC 
11341 AGCCGACTGT ACGCAGTGCC GTCCCGTCAC CCTTTCAGAA CACACTACAG AACGTGCTAG 
11401 CGGCTGCCAC CAAGAGAAAC TGCAACGTCA CGCAAATGCG AGAACTACCC ACCATGGACT 
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11461 CGGCAGTGTT CAACGTGGAG TGCTTCAAGC GCTATGCCTG CTCCGGAGAA TATTGGGAAG 
11521 AATATGCTAA ACAACCTATC CGGATAACCA CTGAGAACAT CACTACCTAT GTGACCAAAT 
11581 TGAAAGGCCC GAAAGCTGCT GCCTTGTTCG CTAAGACCCA CAACTTGGTT CCGCTGCAGG 
11641 AGGTTCCCAT GGACAGATTC ACGGTCGACA TGAAACGAGA TGTCAAAGTC ACTCCAGGGA 
11701 CGAAACACAC AGAGGAAAGA CCCAAAGTCC AGGTAATTCA AGCAGCGGAG CCATTGGCGA 
11761 CCGCTTACCT GTGCGGCATC CACAGGGAAT TAGTAAGGAG ACTAAATGCT GTGTTACGCC 
11821 CTAACGTGCA CACATTGTTT GATATGTCGG CCGAAGACTT TGACGCGATC ATCGCCTCTC 
11881 ACTTCCACCC AGGAGACCCG GTTCTAGAGA CGGACATTGC ATCATTCGAC AAAAGCCAGG . 
11941 ACGACTCCTT GGCTCTTACA GGTTTAATGA TCCTCGAAGA TCTAGGGGTG GATCAGTACC 
12001 TGCTGGACTT GATCGAGGCA GCCTTTGGGG AAATATCCAG CTGTCACCTA CCAACTGGCA 
12061 CGCGCTTCAA GTTCGGAGCT ATGATGAAAT CGGGCATGTT TCTGACTTTG TTTATTAACA 
12121 CTGTTTTGAA CATCACCATA GCAAGCAGGG TACTGGAGCA GAGACTCACT GACTCCGCCT 
12181 GTGCGGCCTT CATCGGCGAC GACAACATCG TTCACGGAGT GATCTCCGAC AAGCTGATGG 
12241 CGGAGAGGTG CGCGTCGTGG GTCAACATGG AGGTGAAGAT CATTGACGCT GTCATGGGCG 
12301 AAAAACCCCC ATATTTTTGT GGGGGATTCA TAGTTTTTGA CAGCGTCACA CAGACCGCCT 
12361 GCCGTGTTTC AGACCCACTT AAGCGCCTGT TCAAGTTGGG TAAGCCGCTA ACAGCTGAAG 
12421 ACAAGCAGGA CGAAGACAGG CGACGAGCAC TGAGTGACGA GGTT 
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pDESTIO Polyhedron Promoter with N-Hi s6 , Baculovirus Transfer Plasmid 
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pDESTIO 6708 bp 



Location (Base Nos.) 
23.. 152 
461.. 337 



Gene Encoded 



Ppolh 
attRl 
CmR 



711.. 1370 
1490 . . 1574 
1712.. 2017 
2058.. 2182 
3394.. 4369 
4510.. 5164 
5658.. 62 



ccdB 

attR2 

ampR 

on" 

genR 



inactivated ccdA 



1 CCCCGGATGA AGTGGTTCGC ATCCTCGGTT TTCTGGAAGG CGAGCATCGT TTGTTCGCCC 
61 AGGACTCTAG CTATAGTTCT AGTGGTTGGC TACGTATACT CCGGAATATT AATAGATCAT 
121 GGAGATAATT AAAATGATAA CCATCTCGCA AATAAATAAG TATTTTACTG TTTTCGTAAC 
181 AGTTTTGTAA TAAAAAAACC TATAAATATT CCGGATTATT CATACCGTCC CACCATCGGG 
241 CGCGGATCTC GGTCCGAAAC CATGTCGTAC TACCATCACC ATCACCATCA CGATTACGAT 
301 ATCCCAACGA CCGAAAACCT GTATTTTCAG GGCATCACAA GTTTGTACAA AAAAGCTGAA 
361 CGAGAAACGT AAAATGATAT AAATATCAAT ATATTAAATT AGATTTTGCA TAAAAAACAG 
421 ACTACATAAT ACTGTAAAAC ACAACATATC CAGTCACTAT GGCGGCCGCT AAGTTGGCAG 
481 CATCACCCGA CGCACTTTGC GCCGAATAAA TACCTGTGAC GGAAGATCAC TTCGCAGAAT 
541 AAATAAATCC TGGTGTCCCT GTTGATACCG GGAAGCCCTG GGCCAACTTT TGGCGAAAAT 
601 GAGACGTTGA TCGGCACGTA AGAGGTTCCA ACTTTCACCA TAATGAAATA AGATCACTAC 
661 CGGGCGTATT TTTTGAGTTA TCGAGATTTT CAGGAGCTAA GGAAGCTAAA ATGGAGAAAA 
721 AAATCACTGG ATATACCACC GTTGATATAT CCCAATGGCA TCGTAAAGAA CATTTTGAGG 
781 CATTT CAGTC AGTTGCTCAA TGTACCTATA ACCAGACCGT TCAGCTGGAT ATTACGGCCT 
841 TTTTAAAGAC CGTAAAGAAA AATAAGCACA AGTTTTATCC GGCCTTTATT CACATTCTTG 
901 CCCGCCTGAT GAATGCTCAT CCGGAATTCC GTATGGCAAT GAAAGACGGT GAGCTGGTGA 
961 TATGGGATAG TGTTCACCCT TGTTACACCG TTTTCCATGA GCAAACTGAA ACGTTTTCAT 
1021 CGCTCTGGAG TGAATACCAC GACGATTTCC GGCAGTTTCT ACACATATAT TCGCAAGATG 
1081 TGGCGTGTTA CGGTGAAAAC CTGGCCTATT TCCCTAAAGG GTTTATTGAG AATATGTTTT 
1141 TCGTCTCAGC CAATCCCTGG GTGAGTTTCA CCAGTTTTGA TTTAAACGTG GCCAATATGG 
1201 ACAACTTCTT CGCCCCCGTT TTCACCATGG GCAAATATTA TACGCAAGGC GACAAGGTGC 
1261 TGATGCCGCT GGCGATTCAG GTTCATCATG CCGTCTGTGA TGGCTTCCAT GTCGGCAGAA 
1321 TGCTTAATGA ATTACAACAG TACTGCGATG AGTGGCAGGG CGGGGCGTAA ACGCGTGGAT 
1381 CCGGCTTACT AAAAGCCAGA TAACAGTATG CGTATTTGCG CGCTGATTTT TGCGGTATAA 
1441 GAATATATAC TGATATGTAT ACCCGAAGTA TGTCAAAAAG AGGTGTGCTA TGAAGCAGCG 
1501 TATTACAGTG ACAGTTGACA GCGACAGCTA TCAGTTGCTC AAGGCATATA TGATGTCAAT 
1561 ATCTCCGGTC TGGTAAGCAC AACCATGCAG AATGAAGCCC GTCGTCTGCG TGCCGAACGC 
1621 TG GAAA GCGG AAAATCAGGA AGGGATGGCT GAGGTCGCCC GGTTTATTGA AATGAACGGC 
1681 TCTTTTGCTG ACGAGAACAG GGACTGGTGA AATGCAGTTT AAGGTTTACA CCTATAAAAG 
1741 AGAGAGCCGT TATCGTCTGT TTGTGGATGT ACAGAGTGAT ATTATTGACA CGCCCGGGCG 
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1801 ACGGATGGTG ATCCCCCTGG CCAGTGCACG TCTGCTGTCA GATAAAGTCT CCCGTGAACT 
1861 TTACCCGGTG GTGCATATCG GGGATGAAAG CTGGCGCATG ATGACCACCG ATATGGCCAG 
1921 TGTGCCGGTC TCCGTTATCG GGGAAGAAGT GGCTGATCTC AGCCACCGCG AAAATGACAT 
1981 CAAAAACGCC ATTAACCTGA TGTTCTGGGG AATATAAATG TCAGGCTCCC TTATACACAG 
2041 C CAGTCT GCA GGTCGACCAT AGTGACTGGA TATGTTGTGT TTTACAGTAT TATGTAGTCT 
2101 Gl 1 1 1 I IATG CAAAATCTAA TTTAATATAT TGATATTTAT ATCATTTTAC GTTTCTCGTT 
2161 CAGCTTTCTT GTACAAAGTG GTGATGCCAT GGATCCGGAA TTCAAAGGCC TACGTCGACG 
2221 AGCTCAACTA GTGCGGCCGC TTTCGAATCT AGAGCCTGCA GTCTCGAGGC ATGCGGTACC 
2281 AAGCTTGTCG AGAAGTACTA GAGGATCATA ATCAGCCATA CCACATTTGT AGAGGTTTTA 
2341 CTTGCTTTAA AAAACCTCCC ACACCTCCCC CTGAACCTGA AACATAAAAT GAATGCAATT 
2401 GTTGTTGTTA ACTTGTTTAT TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA 
2461 AATTTCACAA ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC 
2521 AATGTATCTT ATCATGTCTG GATCTGATCA CTGCTTGAGC CTAGGAGATC CGAACCAGAT 
2581 AAGTGAAATC TAGTTCCAAA CTATTTTGTC ATTTTTAATT TTCGTATTAG CTTACGACGC 
2641 TACACCCAGT TCCCATCTAT TTTGTCACTC TTCCCTAAAT AATCCTTAAA AACTCCATTT 
2701 CC ACCCCT CC CAGTTCCCAA CTATTTTGTC CGCCCACAGC GGGGCATTTT TCTTCCTGTT 
2761 ATGTTTTTAA TCAAACATCC TGCCAACTCC ATGTGACAAA CCGTCATCTT CGGCTACTTT 
2821 TTCTCTGTCA CAGAATGAAA ATTTTTCTGT CATCTCTTCG TTATTAATGT TTGTAATTGA 
2881 CTGAATATCA ACGCTTATTT GCAGCCTGAA TGGCGAATGG GACGCGCCCT GTAGCGGCGC 
2941 ATTAAGCGCG GCGGGTGTGG TGGTTACGCG CAGCGTGACC GCTACACTTG CCAGCGCCCT 
3001 AGCGCCCGCT CCTTTCGCTT TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG GCTTTCCCCG 
3061 TCAAGCTCTA AATCGGGGGC TCCCTTTAGG GTTCCGATTT AGTGCTTTAC GGCACCTCGA 
3121 CCCC AAAAAA CTTGATTAGG GTGATGGTTC ACGTAGTGGG CCATCGCCCT GATAGACGGT 
3181 TTTTCGCCCT TTGACGTTGG AGTCCACGTT CTTTAATAGT GGACTCTTGT TCCAAACTGG 
3241 AACAACACTC AACCCTATCT CGGTCTATTC TTTTGATTTA TAAGGGATTT TGCCGATTTC 
3301 GGCCTATTGG TTAAAAAATG AGCTGATTTA ACAAAAATTT AACGCGAATT TTAACAAAAT 
3361 ATTAACGTTT ACAATTTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG 
3421 TTTAI I I I IC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT 
3481 GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT 
3541 TCCC I I I I I I GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT 
3601 AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG 
3661 CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA 
3721 AGTTCTGCTA TGTGGCGCGG TATTATCCCG TATTGACGCC GGGCAAGAGC AACTCGGTCG 
3781 CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT 
3841 TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC 
3901 TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CI I I I I IGCA 
3961 CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT 
4021 ACCAAACGAC GAGCGTGACA CCACGATGCC TGTAGCAATG GCAACAACGT TGCGCAAACT 
4081 ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC 
4141 GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA 
4201 TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG 
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4261 TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG 
4321 AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA 
4381 AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA 
4441 GGTGAAGATC C I I I I I GATA ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA 
4501 CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG 
4561 CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA 
4621 TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA 
4681 TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC 
4741 TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG 
4801 TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC 
4861 GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT 
4921 ACAGCGTGAG CATTGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC 
4981 GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG 
5041 GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG 
5101 CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT 
5161 GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA 
5221 TAACCGTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG 
5281 CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA 
5341 TCTGTGCGGT ATTTCACACC GCAGACCAGC CGCGTAACCT GGCAAAATCG GTTACGGTTG 
5401 AGTAATAAAT GGATGCCCTG CGTAAGCGGG TGTGGGCGGA CAATAAAGTC TTAAACTGAA 
5461 CAAAATAGAT CTAAACTATG ACAATAAAGT CTTAAACTAG ACAGAATAGT TGTAAACTGA 
5521 AATCAGTCCA GTTATGCTGT GAAAAAGCAT ACTGGACTTT TGTTATGGCT AAAGCAAACT 
5581 CTTCATTTTC TGAAGTGCAA ATTGCCCGTC GTATTAAAGA GGGGCGTGGC CAAGGGCATG 
5641 GTAAAGACTA TATTCGCGGC GTTGTGACAA TTTACCGAAC AACTCCGCGG CCGGGAAGCC 
5701 GATCTCGGCT TGAACGAATT GTTAGGTGGC GGTACTTGGG TCGATATCAA AGTGCATCAC 
5761 TTCTTCCCGT ATGCCCAACT TTGTATAGAG AGCCACTGCG GGATCGTCAC CGTAATCTGC 
5821 TTGCACGTAG ATCACATAAG CACCAAGCGC GTTGGCCTCA TGCTTGAGGA GATTGATGAG 
5881 CGCGGTGGCA ATGCCCTGCC TCCGGTGCTC GCCGGAGACT GCGAGATCAT AGATATAGAT 
5941 CTCACTACGC GGCTGCTCAA ACCTGGGCAG AACGTAAGCC GCGAGAGCGC CAACAACCGC 
6001 TTCTTGGTCG AAGGCAGCAA GCGCGATGAA TGTCTTACTA CGGAGCAAGT TCCCGAGGTA 
6061 ATCGGAGTCC GGCTGATGTT GGGAGTAGGT GGCTACGTCT CCGAACTCAC GACCGAAAAG 
6121 ATCAAGAGCA GCCCGCATGG ATTTGACTTG GTCAGGGCCG AGCCTACATG TGCGAATGAT 
6181 GCCCATACTT GAGCCACCTA ACTTTGTTTT AGGGCGACTG CCCTGCTGCG TAACATCGTT 
6241 GCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCAAACA TCGACCCACG GCGTAACGCG 
6301 CTTGCTGCTT GGATGCCCGA GGCATAGACT GTACAAAAAA ACAGTCATAA CAAGCCATGA 
6361 AAACCGCCAC TGCGCCGTTA CCACCGCTGC GTTCGGTCAA GGTTCTGGAC CAGTTGCGTG 
6421 AGCGCATACG CTACTTGCAT TACAGTTTAC GAACCGAACA GGCTTATGTC AACTGGGTTC 
6481 GTGCCTTCAT CCGTTTCCAC GGTGTGCGTC ACCCGGCAAC CTTGGGCAGC AGCGAAGTCG 
6541 AGGCATTTCT GTCCTGGCTG GCGAACGAGC GCAAGGTTTC GGTCTCCACG CATCGTCAGG 
6601 CATTGGCGGC CTTGCTGTTC TTCTACGGCA AGGTGCTGTG CACGGATCTG CCCTGGCTTC 
6661 AGGAGATCGG AAGACCTCGG CCGTCGCGGC GCTTGCCGGT GGTGCTGA 
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r^mRNA FROM CMV PROMOTER (CONTROLLED by TETRACYCLINE) 
358 tag tga ace g|tc aga teg cct gga gac gec ate cac get gtt ttg ace tec 
ate act tgg cag tct age gga cct ctg egg tag gtg cga caa aoc tgg agg 

409 ota gaa gac acc ggg acc gat cca gec tec gcg gee ccg aat teg age teg 
tat ctt ctg tgg ccc tgg eta ggt egg agg cgc egg ggc tta age teg age 
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pDESTll 7026 bp 



Location (Base Nos.) 
4.. 479 



Gene Encoded 

Tetp ((Tet operator) 7 and min 



888 . . 1547 
1667. .1751 
1889. .2194 
2235 . . 2359 
2402. .4132 
4347. .4803 
4940 . . 5797 



638 . . 514 



hCMV promoter) 
attRl 
CmR 

inactivated ccdA 

ccdB 

attR2 

polyA 

fl ori 

ampR 



1 CGAGTTTACC ACTCCCTATC AGTGATAGAG AAAAGTGAAA GTCGAGTTTA CCACTCCCTA 
61 TCAGTGATAG AGAAAAGTGA AAGTCGAGTT TACCACTCCC TATCAGTGAT AGAGAAAAGT 
121 GAAAGTCGAG TTTACCACTC CCTATCAGTG ATAGAGAAM GTGAAAGTCG AGTTTACCAC 
181 TCCCTATCAG TGATAGAGAA AAGTGAAAGT CGAGTTTACC ACTCCCTATC AGTGATAGAG 
241 AAAAGTGAAA GTCGAGTTTA CCACTCCCTA TCAGTGATAG AGAAAAGTGA AAGTCGAGCT 
301 CGGTACCCGG GTCGAGTAGG CGTGTACGGT GGGAGGCCTA TATAAGCAGA GCTCGTTTAG 
361 TGAACCGTCA GATCGCCTGG AGACGCCATC CACGCTGTTT TGACCTCCAT AGAAGACACC 
421 GGGACCGATC CAGCCTCCGC GGCCCCGAAT TCGAGCTCGG TACCCGGGGA TCCTCTAGAG 
481 TCGAGGTCGA CGGTATCGAT AAGCTTGATA TCAACAAGTT TGTACAAAAA AGCTGAACGA 
541 GAAACGTAAA ATGATATAAA TATCAATATA TTAAATTAGA TTTTGCATAA AAAACAGACT 
601 ACATAATACT GTAAAACACA ACATATCCAG TCACTATGGC GGCCGCTAAG TTGGCAGCAT 
661 CACCCGACGC ACTTTGCGCC GAATAAATAC CTGTGACGGA AGATCACTTC GCAGAATAAA 
721 TAAATCCTGG TGTCCCTGTT GATACCGGGA AGCCCTGGGC CAACTTTTGG CGAAAATGAG 
781 ACGTT GATCG GCACGTAAGA GGTTCCAACT TTCACCATAA TGAAATAAGA TCACTACCGG 
841 GCGTATTTTT TGAGTTATCG AGATTTTCAG GAGCTAAGGA AGCTAAAATG GAGAAAAAAA 
901 TCACTGGATA TACCACCGTT GATATATCCC AATGGCATCG TAAAGAACAT TTTGAGGCAT 
961 TTCAGTCAGT TGCTCAATGT ACCTATAACC AGACCGTTCA GCTGGATATT ACGGCCTTTT 
1021 TAAAGACCGT AAAGAAAAAT AAGCACAAGT TTTATCCGGC CTTTATTCAC ATTCTTGCCC 
1081 GCCTGATGAA TGCTCATCCG GAATTCCGTA TGGCAATGAA AGACGGTGAG CTGGTGATAT 
1141 GGGATAGTGT TCACCCTTGT TACACCGTTT TCCATGAGCA AACTGAAACG TTTTCATCGC 
1201 TCTGGAGTGA ATACCACGAC GATTTCCGGC AGTTTCTACA CATATATTCG CAAGATGTGG 
1261 CGTGTTACGG TGAAAACCTG GCCTATTTCC CTAAAGGGTT TATTGAGAAT ATGTTTTTCG 
1321 TCTCAGCCAA TCCCTGGGTG AGTTTCACCA GTTTTGATTT AAACGTGGCC AATATGGACA 
1381 ACTTCTTCGC CCCCGTTTTC ACCATGGGCA AATATTATAC GCAAGGCGAC AAGGTGCTGA 
1441 TGCCGCTGGC GATTCAGGTT CATCATGCCG TCTGTGATGG CTTCCATGTC GGCAGAATGC 
1501 TTAATGAATT ACAACAGTAC TGCGATGAGT GGCAGGGCGG GGCGTAAAGA TCTGGATCCG 
1561 GCTTACTAAA AGCCAGATAA CAGTATGCGT ATTTGCGCGC TGATTTTTGC GGTATAAGAA 
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1621 TATATACTGA TATGTATACC CGAAGTATGT 

1681 TACAGTGACA GTTGACAGCG ACAGCTATCA 

1741 TCCGGTCTGG TAAGCACAAC CATGCAGAAT 

1801 AAAGCGGAAA ATCAGGAAGG GATGGCTGAG 

1861 TTTGCTGACG AGAACAGGGA CTGGTGAAAT 

1921 GAGCCGTTAT CGTCTGTTTG TGGATGTACA 

1981 GATGGTGATC CCCCTGGCCA GTGCACGTCT 

2041 CCCGGTGGTG CATATCGGGG ATGAAAGCTG 

2101 GCCGGTCTCC GTTATCGGGG AAGAAGTGGC 

2161 AAACGCCATT AACCTGATGT TCTGGGGAAT 

2221 GTCTGCAGGT CGACCATAGT GACTGGATAT 

2281 TTTTATGCAA AATCTAATTT AATATATTGA 

2341 CTTTCTTGTA CAAAGTGGTT GATATCGAAT 

2401 GAGCACTGCG ATGAGTGGCA GGGCGGGGCG 

2461 TAAACGCCTG GTGCTACGCC TGAATAAGTG 

2521 CGGATCTTTG TGAAGGAACC TTACTTCTGT 

2581 GAGATTTAAA GCTCTAAGGT AAATATAAAA 

2641 ATTCTAATTG TTTGTGTATT TTAGATTCCA 

2701 TGGAATGCCT TTAATGAGGA AAACCTGTTT 

2761 GAGGCTACTG CTGACTCTCA ACATTCTACT 

2821 CCCAAGGACT TTCCTTCAGA ATTGCTAAGT 

2881 ACTCTTGCTT GCTTTGCTAT TTACACCACA 

2941 ATTATGGAAA AATATTCTGT AACCTTTATA 

3001 CTGI I I I I IC TTACTCCACA CAGGCATAGA 

3061 TTGTGTACCT TTAGCTTTTT AATTTGTAAA 

3121 GCCTTGACTA GAGATCATAA TCAGCCATAC 

3181 AAACCTCCCA CACCTCCCCC TGAACCTGAA 

3241 CTTGTTTATT GCAGCTTATA ATGGTTACAA 

3301 TAAAGCATTT TTTTCACTGC ATTCTAGTTG 

3361 TCATGTCTGG ATCCCCAGGA AGCTCCTCTG 

3421 GAGAGGACAT TCCAATCATA GGCTGCCCAT 

3481 CACTTAACAA AAAGGAAATT GGGTAGGGGT 

3541 TAAAATATCT GGGAAGTCCC TTCCACTGCT 

•3601 AAATGTCAAC AGCAGAAACA TACAAGCTGT 

3661 CATCAAGAAG CACTGTGGTT GCTGTGTTAG 

3721 CACCTGTGTA GGTTCCAAAA TATCTAGTGT 

3781 ACTCCACTGG ATAAGCATTA TCCTTATCCA 

3841 GACTGTCAAC TGTAGCATTT TTTGGGGTTA 



CAAAAAGAGG TGTGCTATGA AGCAGCGTAT 
GTTGCTCAAG GCATATATGA TGTCAATATC 
GAAGCCCGTC GTCTGCGTGC CGAACGCTGG 
GTCGCCCGGT TTATTGAAAT GAACGGCTCT 
GCAGTTTAAG GTTTACACCT ATAAAAGAGA 
GAGTGATATT ATTGACACGC CCGGGCGACG 
GCTGTCAGAT AAAGTCTCCC GTGAACTTTA 
GCGCATGATG ACCACCGATA TGGCCAGTGT 
TGATCTCAGC CACCGCGAAA ATGACATCAA 
ATAAATGTCA GGCTCCCTTA TACACAGCCA 
GTTGTGTTTT ACAGTATTAT GTAGTCTGTT 
TATTTATATC ATTTTACGTT TCTCGTTCAG 
TCCTGCAGCC CGGGGGATCC ACTAGTTCTA 
TAAI 1 1 I I I I AAGGCAGTTA TTGGTGCCCT 
ATAATAAGCG GATGAATGGC AGAAATTCGC 
GGTGTGACAT AATTGGACAA ACTACCTACA 
TTTTTAAGTG TATAATGTGT TAAACTACTG 
ACCTATGGAA CTGATGAATG GGAGCAGTGG 
TGCTCAGAAG AAATGCCATC TAGTGATGAT 
CCTCCAAAAA AGAAGAGAAA GGTAGAAGAC 
TTTTTGAGTC ATGCTGTGTT TAGTAATAGA 
AAGGAAAAAG CTGCACTGCT ATACAAGAAA 
AGTAGGCATA ACAGTTATAA TCATAACATA 
GTGTCTGCTA TTAATAACTA TGCTCAAAAA 
GGGGTTAATA AGGAATATTT GATGTATAGT 
CACATTTGTA GAGGTTTTAC TTGCTTTAAA 
ACATAAAATG AATGCAATTG TTGTTGTTAA 
ATAAAGCAAT AGCATCACAA ATTTCACAAA 
TGGTTTGTCC AAACTCATCA ATGTATCTTA 
TGTCCTCATA AACCCTAACC TCCTCTACTT 
CCACCCTCTG TGTCCTCCTG TTAATTAGGT 
TTTTCACAGA CCGCTTTCTA AGGGTAATTT 
GTGTTCCAGA AGTGTTGGTA AACAGCCCAC 
CAGCTTTGCA CAAGGGCCCA ACACCCTGCT 
TAATGTGCAA AACAGGAGGC ACATTTTCCC 
TTTCAI I I I I ACTTGGATCA GGAACCCAGC 
AAACAGCCTT GTGGTCAGTG TTCATCTGCT 
CAGTTTGAGC AGGATATTTG GTCCTGTAGT 
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3901 TTGCTAACAC ACCCTGCAGC TCCAAAGGTT 
3961 ACCCTTGAAT GGGTTTTCCA GCACCATTTT 
4021 TTAACATAGC AGTTACCCCA ATAACCTCAG 
4081 TATTTCCACA GGTTAAGTCC TCATTTAAAT 
4141 CACCGCGGTG GAGCTCCAAT TCGCCCTATA 
4201 TCGTTTTACA ACGTCGTGAC TGGGAAAACC 
4261 CACATCCCCC TTTCGCCAGC TGGCGTAATA 
4321 AACAGTTGCG CAGCCTGAAT GGCGAATGGG 
4381 CGGGTGTGGT GGTTACGCGC AGCGTGACCG 
4441 CTTTCGCTTT CTTCCCTTCC TTTCTCGCCA 
4501 ATCGGGGGCT CCCTTTAGGG TTCCGATTTA 
4561 TTGATTAGGG TGATGGTTCA CGTAGTGGGC 
4621 TGACGTTGGA GTCCACGTTC TTTAATAGTG 
4681 ACCCTATCTC GGTCTATTCT TTTGATTTAT 
4741 TAAAAAATGA GCTGATTTAA CAAAAATTTA 
4801 CAATTTAGGT GGCACTTTTC GGGGAAATGT 
4861 AATACATTCA AATATGTATC CGCTCATGAG 
4921 TTGAAAAAGG AAGAGTATGA GTATTCAACA 
4981 GGCATTTTGC CTTCCTGTTT TTGCTCACCC 
5041 AGATCAGTTG GGTGCACGAG TGGGTTACAT 
5101 TGAGAGTTTT CGCCCCGAAG AACGTTTTCC 
5161 TGGCGCGGTA TTATCCCGTA TTGACGCCGG 
5221 TTCTCAGAAT GACTTGGTTG AGTACTCACC 
5281 GACAGTAAGA GAATTATGCA GTGCTGCCAT 
5341 ACTTCTGACA ACGATCGGAG GACCGAAGGA 
5401 TCATGTAACT CGCCTTGATC GTTGGGAACC 
5461 GCGTGACACC ACGATGCCTG TAGCAATGGC 
5521 ACTACTTACT CTAGCTTCCC GGCAACAATT 
5581 AGGACCACTT CTGCGCTCGG CCCTTCCGGC 
5641 CGGTGAGCGT GGGTCTCGCG GTATCATTGC 
5701 TATCGTAGTT ATCTACACGA CGGGGAGTCA 
5761 CGCTGAGATA GGTGCCTCAC TGATTAAGCA 
5821 TATACTTTAG ATTGATTTAA AACTTCATTT 
5881 TTTTGATAAT CTCATGACCA AAATCCCTTA 
5941 CCCCGTAGAA AAGATCAAAG GATCTTCTTG 
6001 CTTGCAAACA AAAAAACCAC CGCTACCAGC 
6061 AACTC I I I I I CCGAAGGTAA CTGGCTTCAG 
6121 AGTGTAGCCG TAGTTAGGCC ACCACTTCAA 



CCCCACCAAC AGCAAAAAAA TGAAMTTTG 
CATGAGTTTT TTGTGTCCCT GAATGCAAGT 
TTTTAACAGT AACAGCTTCC CACATCAAAA 
TAGGCAAAGG AATTGCTCTA GAGCGGCCGC 
GTGAGTCGTA TTACGCGCGC TCACTGGCCG 
CTGGCGTTAC CCAACTTAAT CGCCTTGCAG 
GCGAAGAGGC CCGCACCGAT CGCCCTTCCC 
ACGCGCCCTG TAGCGGCGCA TTAAGCGCGG 
CTACACTTGC CAGCGCCCTA GCGCCCGCTC 
CGTTCGCCGG CTTTCCCCGT CAAGCTCTAA 
GTGCTTTACG GCACCTCGAC CCCAAAAAAC 
CATCGCCCTG ATAGACGGTT TTTCGCCCTT 
GACTCTTGTT CCAAACTGGA ACAACACTCA 
AAGGGATTTT GCCGATTTCG GCCTATTGGT 
ACGCGAATTT TAACAAAATA TTAACGCTTA 
GCGCGGAACC CCTATTTGTT TATTTTTCTA 
ACAATAACCC TGATAAATGC TTCAATAATA 
TTTCCGTGTC GCCCTTATTC CCI 1 1 1 1 IGC 
AGAAACGCTG GTGAAAGTAA AAGATGCTGA 
CGAACTGGAT CTCAACAGCG GTAAGATCCT 
AATGATGAGC ACTTTTAAAG TTCTGCTATG 
GCAAGAGCAA CTCGGTCGCC GCATACACTA 
AGTCACAGAA AAGCATCTTA CGGATGGCAT 
AACCATGAGT GATAACACTG CGGCCAACTT 
GCTAACCGCT TTTTTGCACA ACATGGGGGA 
GGAGCTGAAT GAAGCCATAC CAAACGACGA 
AACAACGTTG CGCAAACTAT TAACTGGCGA 
AATAGACTGG ATGGAGGCGG ATAAAGTTGC 
TGGCTGGTTT ATTGCTGATA AATCTGGAGC 
AGCACTGGGG CCAGATGGTA AGCCCTCCCG 
GGCAACTATG GATGAACGAA ATAGACAGAT 
TTGGTAACTG TCAGACCAAG TTTACTCATA 
TTAATTTAAA AGGATCTAGG TGAAGATCCT 
ACGTGAGTTT TCGTTCCACT GAGCGTCAGA 
AGATCCTTTT TTTCTGCGCG TAATCTGCTG 
GGTGGTTTGT TTGCCGGATC AAGAGCTACC 
CAGAGCGCAG ATACCAAATA CTGTCCTTCT 
GAACTCTGTA GCACCGCCTA CATACCTCGC 
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6181 TCTGCTAATC CTGTTACCAG TGGCTGCTGC 
6241 GGACTCAAGA CGATAGTTAC GGGATAAGGC 
6301 CACACAGCCC AGCTTGGAGC GAACGACCTA 
6361 ATGAGAAAGC GCCACGCTTC CCGAAGGGAG 
6421 GGTCGGAACA GGAGAGCGCA CGAGGGAGCT 
6481 TCCTGTCGGG TTTCGCCACC TCTGACTTGA 
6541 GCGGAGCCTA TGGAAAAACG CCAGCAACGC 
6601 GCCTTTTGCT CACATGTTCT TTCCTGCGTT 
6661 CGCCTTTGAG TGAGCTGATA CCGCTCGCCG 
6721 GAGCGAGGAA GCGGAAGAGC GCCCAATACG 
6781 TCATTAATGC AGCTGGCACG ACAGGTTTCC 
6841 AATTAATGTG AGTTAGCTCA CTCATTAGGC 
6901 TCGTATGTTG TGTGGAATTG TGAGCGGATA 
6961 TGATTACGCC AAGCGCGCAA TTAACCCTCA 
7021 CCCCCT 



CAGTGGCGAT AAGTCGTGTC TTACCGGGTT 
GCAGCGGTCG GGCTGAACGG GGGGTTCGTG 
CACCGAACTG AGATACCTAC AGCGTGAGCT 
AAAGGCGGAC AGGTATCCGG TAAGCGGCAG 
TCCAGGGGGA AACGCCTGGT ATCTTTATAG 
GCGTCGATTT TTGTGATGCT CGTCAGGGGG 
GGCCTTTTTA CGGTTCCTGG CCTTTTGCTG 
ATCCCCTGAT TCTGTGGATA ACCGTATTAC 
CAGCCGAACG ACCGAGCGCA GCGAGTCAGT 
CAAACCGCCT CTCCCCGCGC GTTGGCCGAT 
CGACTGGAAA GCGGGCAGTG AGCGCAACGC 
ACCCCAGGCT TTACACTTTA TGCTTCCGGC 
ACAATTTCAC ACAGGAAACA GCTATGACCA 
CTAAAGGGAA CAAAAGCTGG GTACCGGGCC 
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pDEST 12.2 CMV Promoter for Eukaryotic Expression, SV40 
Promoter/ori for G418 Resistance 
mRNA from CMV promoter 
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pDEST12.2 7278 bp (rotated to position 3900) 



Location (Base Nos.) 
86.. 136 
220 . . 742 



1059.. 935 
1168.. 1827 
1947.. 2031 
2169.. 2474 
2515.. 2639 
2824.. 3186 
3310.. 3378 
4363 . . 5157 
5680 . . 6540 



Gene Encoded 
ori 

CMV promoter 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

small t & polyA 
lac 



neo 
ampR 



1 GGGGGGCGGA GCCTATGGAA AAACGCCAGC AACGCGGCCT TTTTACGGTT CCTGGCCTTT 
61 TGCTGGCCTT TTGCTCACAT GTTCTTTCCT GCGTTATCCC CTGATTCTGT GGATAACCGT 
121 ATTACCGCCT TTGAGTGAGC TGATACCGCT CGCCGCAGCC GAACGACCGA GCGCAGCGAG 
181 TCAGTGAGCG AGGAAGCGGA AGAGCTCGCG AATGCATGTC GTTACATAAC TTACGGTAAA 
241 TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA TGACGTATGT 
301 TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA TGGGTGGAGT ATTTACGGTA 
361 AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA AGTACGCCCC CTATTGACGT 
421 CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC ATGACCTTAT GG GACT TTCC 
481 TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC ATGGTGATGC GGTTTTGGCA 
541 GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA TTTCCAAGTC TCCACCCCAT 
601 TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG GACTTTCCAA AATGTCGTAA 
661 CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA CGGTGGGAGG TCTATA TAAG 
721 CAGAGCTCGT TTAGTGAACC GTCAGATCGC CTGGAGACGC CATCCACGCT GTTTTGACCT 
781 CCATAGAAGA CACCGGGACC GATCCAGCCT CCGGACTCTA GCCTAGGCCG CGGGACGGAT 
841 AACAATTTCA CACAGGAAAC AGCTATGACC ATTAGGCCTT TGCAAAAAGC TATTTAGGTG 
901 ACACTATAGA AGGTACGCCT GCAGGTACCG GATCACAAGT TTGTACAAAA AAGCTGMCG 
961 AGAAACGTAA AATGATATAA ATATCAATAT ATTAAATTAG ATTTTGCATA AAAAACAGAC 
1021 TACATAATAC TGTAAAACAC AACATATCCA GTCACTATGG CGGCCGCATT AGGCACCCCA 
1081 GGCTTTACAC TTTATGCTTC CGGCTCGTAT AATGTGTGGA TTTTGAGTTA GGATCCGTCG 
1141 AGATTTTCAG GAGCTAAGGA AGCTAAAATG GAGAAAAAAA TCACTGGATA TACCACCGTT 
1201 GATATATCCC AATGGCATCG TAAAGAACAT TTTGAG GCAT TTCAGTCAGT TGCTCAATGT 
1261 ACCTATAACC AGACCGTTCA GCTGGATATT ACGGCCTTTT TAAAGACCGT AAAGAAAAAT 
1321 AAGCACAAGT TTTATCCGGC CTTTATTCAC ATTCTTGCCC GCCTGATGAA TGCTCATCCG 
1381 GAATTCCGTA TGGCAATGAA AGACGGTGAG CTGGTGATAT GGGATAGTGT TCACCCTTGT 
1441 TACACCGTTT TCCATGAGCA AACTGAAACG TTTTCATCGC TCTGGAGTGA ATACCACGAC 
1501 GATTTCCGGC AGTTTCTACA CATATATTCG CAAGATGTGG CGTGTTACGG TGAAAACCTG 
1561 GCCTATTTCC CTAAAGGGTT TATTGAGAAT ATGTTTTTCG TCTCAGCCAA TCCCTGGGTG 
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1621 AGTTTCACCA GTTTTGATTT AAACGTGGCC AATATGGACA ACTTCTTCGC CCCCGTTTTC 
1681 ACCATGGGCA AATATTATAC GCAAGGCGAC AAGGTGCTGA TGCCGCTGGC GATTCAGGTT 
1741 CATCATGCCG TCTGTGATGG CTTCCATGTC GGCAGAATGC TTAATGAATT ACAACAGTAC 
1801 TGCGATGAGT GGCAGGGCGG GGCGTAAACG CGTGGATCCG GCTTACTAAA AGCCAGATAA 
1861 CAGTATGCGT ATTTGCGCGC TGATTTTTGC GGTATAAGAA TATATACTGA TATGTATACC 
1921 CGAAGTATGT CAAAAAGAGG TGTGCTATGA AGCAGCGTAT TACAGTGACA GTTGACAGCG 
1981 ACAGCTATCA GTTGCTCAAG GCATATATGA TGTCAATATC TCCGGTCTGG TAAGCACAAC 
2041 CATGCAGAAT GAAGCCCGTC GTCTGCGTGC CGAACGCTGG AAAGCGGAAA ATCAGGAAGG 
2101 GATGGCTGAG GTCGCCCGGT TTATTGAAAT GAACGGCTCT TTTGCTGACG AGAACAGGGA 
2161 CTGGTGAAAT GCAGTTTAAG GTTTACACCT ATAAAAGAGA GAGCCGTTAT CGTCTGTTTG 
2221 TGGATGTACA GAGTGATATT ATTGACACGC CCGGGCGACG GATGGTGATC CCCCTGGCCA 
2281 GTGCACGTCT GCTGTCAGAT AAAGTCTCCC GTGAACTTTA CCCGGTGGTG CATATCGGGG 
2341 ATGAAAGCTG GCGCATGATG ACCACCGATA TGGCCAGTGT GCCGGTCTCC GTTATCGGGG 
2401 AAGAAGTGGC TGATCTCAGC CACCGCGAAA ATGACATCAA AAACGCCATT AACCTGATGT 
2461 TCTGGGGAAT ATAAATGTCA GGCTCCCTTA TACACAGCCA GTCTGCAGGT CGACCATAGT 
2521 GACTGGATAT GTTGTGTTTT ACAGTATTAT GTAGTCTGTT TTTTATGCAA AATCTAATTT 
2581 AATATATTGA TATTTATATC ATTTTACGTT TCTCGTTCAG CTTTCTTGTA CAAAGTGGTG 
2641 ATCGCGTGCA TGCGACGTCA TAGCTCTCTC CCTATAGTGA GTCGTATTAT AAGCTAGGCA 
2701 CTGGCCGTCG TTTTACAACG TCGTGACTGG GAAAACTGCT AGCTTGGGAT CTTTGTGAAG 
2761 GAACCTTACT TCTGTGGTGT GACATAATTG GACAAACTAC CTACAGAGAT TTAAAGCTCT 
2821 AAGGTAAATA TAAAATTTTT AAGTGTATAA TGTGTTAAAC TAGCTGCATA TGCTTGCTGC 
2881 TTGAGAGTTT TGCTTACTGA GTATGATTTA TGAAAATATT ATACACAGGA GCTAGTGATT 
2941 CTAATTGTTT GTGTATTTTA GATTCACAGT CCCAAGGCTC ATTTCAGGCC CCTCAGTCCT 
3001 CACAGTCTGT TCATGATCAT AATCAGCCAT ACCACATTTG TAGAGGTTTT ACTTGCTTTA 
3061 AAAAACCTCC CACACCTCCC CCTGAACCTG AAACATAAAA TGAATGCAAT TGTTGTTGTT 
3121 AACTTGTTTA TTGCAGCTTA TAATGGTTAC AAATAAAGCA ATAGCATCAC AAATTTCACA 
3181 AATAAAGCAT I I I I I ICACT GCATTCTAGT TGTGGTTTGT CCAAACTCAT CAATGTATCT 
3241 TATCATGTCT GGATCGATCC TGCATTAATG AATCGGCCAA CGCGCGGGGA GAGGCGGTTT 
3301 GCGTATTGGC TGGCGTAATA GCGAAGAGGC CCGCACCGAT CGCCCTTCCC AACAGTTGCG 
3361 CAGCCTGAAT GGCGAATGGG ACGCGCCCTG TAGCGGCGCA TTAAGCGCGG CGGGTGTGGT 
3421 GGTTACGCGC AGCGTGACCG CTACACTTGC CAGCGCCCTA GCGCCCGCTC CTTTCGCTTT 
3481 CTTCCCTTCC TTTCTCGCCA CGTTCGCCGG CTTTCCCCGT CAAGCTCTAA ATCGGGGGCT 
3541 CCCTTTAGGG TTCCGATTTA GTGCTTTACG GCACCTCGAC CCCAAAAAAC TTGATTAGGG 
3601 TGATGGTTCA CGTAGTGGGC CATCGCCCTG ATAGACGGTT TTTCGCCCTT TGACGTTGGA 
3661 GTCCACGTTC TTTAATAGTG GACTCTTGTT CCAAACTGGA ACAACACTCA ACCCTATCTC 
3721 GGTCTATTCT TTTGATTTAT AAGGGATTTT GCCGATTTCG GCCTATTGGT TAAAAAATGA 
3781 GCTGATTTAA CAAATATTTA ACGCGAATTT TAACAAAATA TTAACGTTTA CAATTTCGCC 
3841 TGATGCGGTA TTTTCTCCTT ACGCATCTGT GCGGTATTTC ACACCGCATA CGCGGATCTG 
3901 CGCAGCACCA TGGCCTGAAA TMCCTCTGA AAGAGGAACT TGGTTAGGTA CCTTCTGAGG 
3961 CGGAAAGAAC CAGCTGTGGA ATGTGTGTCA GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC 
4021 AGCAGGCAGA AGTATGCAAA GCATGCATCT CAATTAGTCA GCAACCAGGT GTGGAAAGTC 
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4081 CCCAGGCTCC CCAGCAGGCA GAAGTATGCA 
4141 AGTCCCGCCC CTAACTCCGC CCATCCCGCC 
4201 GCCCCATGGC TGACTAATTT TTTTTATTTA 
4261 GCTATTCCAG AAGTAGTGAG GAGGCTTTTT 
4321 TTCTTCTGAC ACAACAGTCT CGAACTTAAG 
4381 TTGCACGCAG GTTCTCCGGC CGCTTGGGTG 
4441 CAGACAATCG GCTGCTCTGA TGCCGCCGTG 
4501 CTTTTTGTCA AGACCGACCT GTCCGGTGCC 
4561 CTATCGTGGC TGGCCACGAC GGGCGTTCCT 
4621 GCGGGAAGGG ACTGGCTGCT ATTGGGCGAA 
4681 CTTGCTCCTG CCGAGAAAGT ATCCATCATG 
4741 GATCCGGCTA CCTGCCCATT CGACCACCAA 
4801 CGGATGGMG CCGGTCTTGT CGATCAGGAT 
4861 CCAGCCGAAC TGTTCGCCAG GCTCAAGGCG 
4921 ACCCATGGCG ATGCCTGCTT GCCGAATATC 
4981 ATCGACTGTG GCCGGCTGGG TGTGGCGGAC 
5041 GATATTGCTG AAGAGCTTGG CGGCGAATGG 
5101 GCCGCTCCCG ATTCGCAGCG CATCGCCTTC 
5161 GGACTCTGGG GTTCGAAATG ACCGACCAAG 
5221 AATAAAATAT CTTTATTTTC ATTACATCTG 
5281 ATAAGGATCC GCGTATGGTG CACTCTCAGT 
5341 CAGCCCCGAC ACCCGCCAAC ACCCGCTGAC 
5401 TCCGCTTACA GACAAGCTGT GACCGTCTCC 
5461 TCATCACCGA AACGCGCGAG ACGAAAGGGC 
5521 GTCATGATAA TAATGGTTTC TTAGACGTCA 
5581 ACCCCTATTT GTTTATTTTT CTAAATACAT 
5641 CCCTGATAAA TGCTTCAATA ATATTGAAAA 
5701 GTCGCCCTTA TTCCCTTTTT TGCGGCATTT 
5761 CTGGTGAAAG TAAAAGATGC TGAAGATCAG 
5821 GATCTCAACA GCGGTAAGAT CCTTGAGAGT 
5881 AGCACTTTTA AAGTTCTGCT ATGTGGCGCG 
5941 CAACTCGGTC GCCGCATACA CTATTCTCAG 
6001 GAAAAGCATC TTACGGATGG CATGACAGTA 
6061 AGTGATAACA CTGCGGCCAA CTTACTTCTG 
6121 GCI I I I I IGC ACAACATGGG GGATCATGTA 
6181 AATGAAGCCA TACCAAACGA CGAGCGTGAC 
6241 TTGCGCAAAC TATTAACTGG CGAACTACTT 
6301 TGGATGGAGG CGGATAAAGT TGCAGGACCA 
6361 TTTATTGCTG ATAAATCTGG AGCCGGTGAG 
6421 GGGCCAGATG GTAAGCCCTC CCGTATCGTA 
6481 ATGGATGAAC GAAATAGACA GATCGCTGAG 

fig.: 



AAGCATGCAT CTCAATTAGT CAGCAACCAT 
CCTAACTCCG CCCAGTTCCG CCCATTCTCC 
TGCAGAGGCC GAGGCCGCCT CGGCCTCTGA 
TGGAGGCCTA GGCTTTTGCA AAAAGCTTGA 
GCTAGAGCCA CCATGATTGA ACAAGATGGA 
GAGAGGCTAT TCGGCTATGA CTGGGCACAA 
TTCCGGCTGT CAGCGCAGGG GCGCCCGGTT 
CTGAATGAAC TGCAGGACGA GGCAGCGCGG 
TGCGCAGCTG TGCTCGACGT TGTCACTGAA 
GTGCCGGGGC AGGATCTCCT GTCATCTCAC 
GCTGATGCAA TGCGGCGGCT GCATACGCTT 
GCGAAACATC GCATCGAGCG AGCACGTACT 
GATCTGGACG AAGAGCATCA GGGGCTCGCG 
CGCATGCCCG ACGGCGAGGA TCTCGTCGTG 
ATGGTGGAAA ATGGCCGCTT TTCTGGATTC 
CGCTATCAGG ACATAGCGTT GGCTACCCGT 
GCTGACCGCT TCCTCGTGCT TTACGGTATC 
TATCGCCTTC TTGACGAGTT CTTCTGAGCG 
CGACGCCCAA CCTGCCATCA CGATGGCCGC 
TGTGTTGGTT TTTTGTGTGA ATCGATAGCG 
ACAATCTGCT CTGATGCCGC ATAGTTAAGC 
GCGCCCTGAC GGGCTTGTCT GCTCCCGGCA 
GGGAGCTGCA TGTGTCAGAG GTTTTCACCG 
CTCGTGATAC GCCTATTTTT ATAGGTTAAT 
GGTGGCACTT TTCGGGGAAA TGTGCGCGGA 
TCAAATATGT ATCCGCTCAT GAGACAATAA 
AGGAAGAGTA TGAGTATTCA ACATTTCCGT 
TGCCTTCCTG TTTTTGCTCA CCCAGAAACG 
TTGGGTGCAC GAGTGGGTTA CATCGAACTG 
TTTCGCCCCG AAGAACGTTT TCCAATGATG 
GTATTATCCC GTATTGACGC CGGGCAAGAG 
AATGACTTGG TTGAGTACTC ACCAGTCACA 
AGAGAATTAT GCAGTGCTGC CATAACCATG 
ACAACGATCG GAGGACCGAA GGAGCTAACC 
ACTCGCCTTG ATCGTTGGGA ACCGGAGCTG 
ACCACGATGC CTGTAGCAAT GGCAACAACG 
ACTCTAGCTT CCCGGCAACA ATTAATAGAC 
CTTCTGCGCT CGGCCCTTCC GGCTGGCTGG 
CGTGGGTCTC GCGGTATCAT TGCAGCACTG 
GTTATCTACA CGACGGGGAG TCAGGCAACT 
ATAGGTGCCT CACTGATTAA GCATTGGTAA 
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6541 CTGTCAGACC AAGTTTACTC ATATATACTT TAGATTGATT TAAAACTTCA TTTTTAATTT 
6601 AAAAGGATCT AGGTGAAGAT CCTTTTTGAT AATCTCATGA CCAAAATCCC TTAACGTGAG 
6661 TTTTCG TTCC ACTGAGCGTC AGACCCCGTA GAAAAGATCA AAGGATCTTC TTGAGATCCT 
6721 I l l l I ICTGC GCGTAATCTG CTGCTTGCAA ACAAAAAAAC CACCGCTACC AGCGGTGGTT 
6781 TGTTTGCCGG ATCAAGAGCT ACCAACTCTT TTTCCGAAGG TAACTGGCTT CAGCAGAGCG 
6841 CAGATACCAA ATACTGTCCT TCTAGTGTAG CCGTAGTTAG GCCACCACTT CAAGAACTCT 
6901 GTAGCACCGC CTACATACCT CGCTCTGCTA ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC 
6961 GATAAGTCGT GTCTTACCGG GTTGGACTCA AGACGATAGT TACCGGATAA GGCGCAGCGG 
7021 TCGGGCTGAA CGGGGGGTTC GTGCACACAG CCCAGCTTGG AGCGAACGAC CTACACCGM 
7081 CTGAGATACC TACAGCGTGA GCATTGAGAA AGCGCCACGC TTCCCGAAGG GAGAAAGGCG 
7141 GACAGGTATC CGGTAAGCGG CAGGGTCGGA ACAGGAGAGC GCACGAGGGA GCTTCCAGGG 
7201 GGAAA CGCCT GGTATCTTTA TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA 
7261 TTTTTGTGAT GCTCGTCA 
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Native protein in E. coli:XP(_ promoter 
Bglll 

3721 tgggcoaacc aogocagcta aajgatctctc acctoccooa coatgccccc ctgcoaaaaa 
occcgtttgg ttctgtcgol ttctag^gag tggatggttt gttacggggg gacgtttttt 

3781 taaattcata taaaaaacat acagataacc atctgcggtg ataaattatc tctggcggtg 
otttaagtat attttttgta tgtctattgg tagacgccac tatttaotog agaccgccoc 

-35 XPl promoter -10 r— mRNA 

3841 Ittgocotaoo taccactggc ggtgatactlg agcacatcag caggacgcac tgaccaccat 
ooctgtottt otggtgaccg ccactatgcc tcgtgtagtc gtcctgcgtg actggtggta 



EcoNI 

3901 googgtgacg ctcttaaoaa ttaagccctg [aagoogggca gcottcaaag cagaaggctt 

cttccactgc gagaattttt aattcgggac Btcttcccgt cgtaagtttc gtcttccgaa 

Int ottR1 , 

3961 tggggtgtgt gatacgaaoc gaagcattgg gatcotclaca ogttt |gtoca aoa oogctgo 



occccacaco ctotgctttg cttcgtaacc ctogtogtgt tcooacotgt tt|tttcgoct ^ 
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PDEST13 5848 bp 



Location (Base Nos.) 



Gene Encoded 

ampR 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 



599 . . 1458 
4123.. 3998 
4372.. 5031 
5151.. 5235 
5373 . . 5578 
5719 . . 5843 



1 TTCACTGGCC GTCGTTTTAC AACGTCGTGA CTGGGAAAAC CCTGGCGTTA CCCAACTTAA 
61 TCGCCTTGCA GCACATCCCC CTTTCGCCAG CTGGCGTAAT AGCGAAGAGG CCCGCACCGA 
121 TCGCCCTTCC CAACAGTTGC GCAGCCTGAA TGGCGAATGG CGCCTGATGC GGTATTTTCT 
181 CCTTACGCAT CTGTGCGGTA TTTCACACCG CATATGGTGC ACTCTCAGTA CAATCTGCTC 
241 TGATGCCGCA TAGTTAAGCC AGCCCCGACA CCCGCCAACA CCCGCTGACG CGCCCTGACG 
301 GGCTTGTCTG CTCCCGGCAT CCGCTTACAG ACAAGCTGTG ACCGTCTCCG GGAGCTGCAT 
361 GTGT CAGAG G TTTTCACCGT CATCACCGAA ACGCGCGAGA CGAAAGGGCC TCGTGATACG 
421 CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGACGTCAG GTGGCACTTT 
481 TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA 
541 TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT 
601 GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCC I I I I I I GCGGCATTTT GCCTTCCTGT 
661 TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG 
721 AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA 
781 AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG 
841 TATTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT 
901 TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG 
961 CAGTGCTGCC ATAACCATGA G TGATAA CAC TGCGGCCAAC TTACTTCTGA CAACGATCGG 
1021 AGGACCGAAG GAGCTAACCG CI 1 1 I I IGCA CAACATGGGG GATCATGTAA CTCGCCTTGA 
1081 TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC 
1141 TGTAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC 
1201 CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC 
1261 GGCCCTTCCG GCTGGCTGGT tTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG 
1321 CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC 
1381 GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC 
1441 ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT 
1501 AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC 
1561 CAAAATCCCT TAACGTGAGT TTTCG TTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA 
1621 AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC 
1681 ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT 
1741 AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTTCTT CTAGTGTAGC CGTAGTTAGG 
1801 CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC 
1861 AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT 
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1921 ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA 
1981 GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CATTGAGAAA GCGCCACGCT 
2041 TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG 
2101 CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA 
2161 CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA 
2221 CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT 
2281 CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA 
2341 TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA 
2401 GCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA 
2461 CGACAGGTTT CCCGACTGGA AAGCGGGCAG TGAGCGCAAC GCAATTAATG TGAGTTAGCT 
2521 CACTCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATGT TGTGTGGAAT 
2581 TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC CATGATTACG CCAAGCTTGG 
2641 CTGCAGGTGA TGAT TATCAG CCAGCAGAGA TTAAGGAAAA CAGACAGGTT TATTGAGCGC 
2701 TTATCTTTCC CTTTATTTTT GCTGCGGTAA GTCGCATAAA AACCATTCTT CATAATTCAA 
2761 TCCATTTACT ATGTTATGTT CTGAGGGGAG TGAAAATTCC CCTAATTCGA TGAAGATTCT 
2821 TGCTCAATTG TTATCAGCTA TGCGCCGACC AGAACACCTT GCCGATCAGC CAAACGTCTC 
2881 TTCAGGCCAC TGACTAGCGA TAACTTTCCC CACAACGGAA CAACTCTCAT TGCATGGGAT 
2941 CATTGGGTAC TGTGGGTTTA GTGGTTGTAA AAACACCTGA CCGCTATCCC TGATCAGTTT 
3001 CTTGAAGGTA AACTCATCAC CCCCAAGTCT GGCTATGCAG AAATCACCTG GCTCAACAGC 
3061 CTGCTCAGGG TCAACGAGAA TTAACATTCC GTCAGGAAAG CTTGGCTTGG AGCCTGTTGG 
3121 TGCGGTCATG GAATTACCTT CAACCTCAAG CCAGAATGCA GAATCACTGG C I I 1 1 I I GGT 
3181 TGTGCTTACC CATCTCTCCG CATCACCTTT GGTAAAGGTT CTAAGCTTAG GTGAGAACAT 
3241 CCCTGCCTGA ACATGAGAAA AAACAGGGTA CTCATACTCA CTTCTAAGTG ACGGCTGCAT 
3301 ACTAACCGCT TCA TACATC T CGTAGATTTC TCTGGCGATT GAAGGGCTAA ATTCTTCAAC 
3361 GCTAACTTTG AGAATTTTTG CAAGCAATGC GGCGTTATAA GCATTTAATG CATTGATGCC 
3421 ATTAAATAAA GCACCAACGC CTGACTGCCC CATCCCCATC TTGTCTGCGA CAGATTCCTG 
3481 GGATAAGCCA AGTTCATTTT TCI I I I 1 1 IC ATAAATTGCT TTAAGGCGAC GTGCGTCCTC 
3541 AAGCTGCTCT TGTGTTAATG GTTTCTTTTT TGTGCTCATA CGTTAAATCT ATCACCGCAA 
3601 GGGATAAATA TCTAACACCG TGCGTGTTGA CTATTTTACC TCTGGCGGTG ATAATGGTTG 
3661 CATGTACTAA GGAGGTTGTA TGGAACAACG CATAACCCTG AAAGATTATG CAATGCGCTT 
3721 TGGGCAAACC AAGACAGCTA AAGATCTCTC ACCTACCAAA CAATGCCCCC CTGCAAAAAA 
3781 TAAATTCATA TAAAAAACAT ACAGATAACC ATCTGCGGTG ATAAATTATC TCTGGCGGTG 
3841 TTGACATAAA TACCACTGGC GGTGATACTG AGCACATCAG CAGGACGCAC TGACCACCAT 
3901 GAAGGTGACG CTCTTAAAAA TTAAGCCCTG AAGAAGGGCA GCATTCAAAG CAGAAGGCTT 
3961 TGGGGTGTGT GATACGAAAC GAAGCATTGG GATCATCACA AGTTTGTACA AAAAAGCTGA 
4021 ACGAGAAACG TAAAATGATA TAAATATCAA TATATTAAAT TAGATTTTGC ATAAAAAACA 
4081 GACTACATAA TACTGTAAAA CACAACATAT CCAGTCACTA TGGCGGCCGC TAAGTTGGCA 
4141 GCATCACCCG ACGCACTTTG CGCCGAATAA ATACCTGTGA CGGAAGATCA CTTCGCAGAA 
4201 TAAATAAATC CTGGTGTCCC TGTTGATACC GGGAAGCCCT GGGCCAACTT TTGGCGAAAA 
4261 TGAGACGTTG ATCGGC ACGT AAGAGGTTCC AACTTTCACC ATAATGAAAT AAGATCACTA 
4321 CCGGGCGTAT TTTTTGAGTT ATCGAGATTT TCAGGAGCTA AGGAAGCTAA AATGGAGAAA 
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4381 AAAATCACTG GATATACCAC CGTTGATATA TCCCAATGGC ATCGTAAAGA ACATTTTGAG 
4441 GCATTTCAGT CAGTTGCTCA ATGTACCTAT AACCAGACCG TTCAGCTGGA TATTACGGCC 
4501 TTTTTAAAGA CCGTAAAGAA AAATAAGCAC AAGTTTTATC CGGCCTTTAT TCACATTCTT 
4561 GCCCGCCTGA TGAATGCTCA TCCGGAATTC CGTATGGCAA TGAAAGACGG TGAGCTGGTG 
4621 ATATGGGATA GTGTTCACCC TTGTTACACC GTTTTCCATG AGCAAACTGA AACGTTTTCA 
4681 TCGCTCTGGA GTGAATACCA CGACGATTTC CGGCAGTTTC TACACATATA TTCGCAAGAT 
4741 GTGGCGTGTT ACGGTGAAAA CCTGGCCTAT TTCCCTAAAG GGTTTATTGA GAATATGTTT 
4801 TTCGTCTCAG CCAATCCCTG GGTGAGTTTC ACCAGTTTTG ATTTAAACGT GGCCAATATG 
4861 GACAACTTCT TCGCCCCCGT TTTCACCATG GGCAAATATT ATACGCAAGG CGACAAGGTG 
4921 CTGATGCCGC TGGCGATTCA GGTTCATCAT GCCGTCTGTG ATGGCTTCCA TGTCGGCAGA 
4981 ATGCTTAATG AATTACAACA GTACTGCGAT GAGTGGCAGG GCGGGGCGTA AACGCGTGGA 
5041 TCCGGCTTAC TAAAAGCCAG ATAACAGTAT GCGTATTTGC GCGCTGATTT TTGCGGTATA 
5101 AGAATATATA CTGATATGTA TACCCGAAGT ATGTCAAAAA GAGGTGTGCT ATGAAGCAGC 
5161 GTATTACAGT GACAGTTGAC AGCGACAGCT ATCAGTTGCT CAAGGCATAT ATGATGTCAA 
5221 TATCTCCGGT CTGGTAAGCA CAACCATGCA GAATGAAGCC CGTCGTCTGC GTGCCGAACG 
5281 CTGGAAAGCG GAAAATCAGG AAGGGATGGC TGAGGTCGCC CGGTTTATTG AAATGAACGG 
5341 CTCTTTTGCT GACGAGAACA GGGACTGGTG AAATGCAGTT TAAGGTTTAC ACCTATAAAA 
5401 GAGAGAGCCG TTATCGTCTG TTTGTGGATG TACAGAGTGA TATTATTGAC ACGCCCGGGC 
5461 GACGGATGGT GATCCCCCTG GCCAGTGCAC GTCTGCTGTC AGATAAAGTC TCCCGTGAAC 
5521 TTTACCCGGT GGTGCATATC GGGGATGAAA GCTGGCGCAT GATGACCACC GATATGGCCA 
5581 GTGTGCCGGT CTCCGTTATC GGGGAAGAAG TGGCTGATCT CAGCCACCGC GAAAATGACA 
5641 TCAAAAACGC CATTAACCTG ATGTTCTGGG GAATATAAAT GTCAGGCTCC GTTATACACA 
5701 GCCAGTCTGC AGGTCGACCA TAGTGACTGG ATATGTTGTG TTTTACAGTA TTATGTAGTC 
5761 TGI I I I I I AT GCAAAATCTA ATTTAATATA TTGATATTTA TATCATTTTA CGTTTCTCGT 
5821 TCAGCTTTCT TGTACAAAGT GGTGATAA 
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Native Protein Expression in E. coli, T7 Promoter 



Bglll 



Asel pT7 



3961 tgccggccac gatgcgtccg gcgtagogga tcgajgatctc gatcccgcgo aatjLaatacg 
acggccggtg ctocgcaggc cgcotctcct agctctagjag ctogggcgct tt qotjtatgc 

mRNA Xbal Int attRl 



4021 actcactata 
tgagtgotat 



gggbgaccac oacggtttcc ct |ctogo tca coogttt |gto coooo aagct 
ccctctggtg ttgccaaagg gogatcltag t gttcooocat gttttttcga 

I T 



SgrA I 6381 
Sph I 6230 \ 
fcoN I 6165. N \ N 



Bgl\ i 1 



/ ^Xbo I 59 



Nru 



Bsg I 5138 



Tth\\\ I 4568 
Sap I 4435 
BsplUU I 431 




955 
Mlu\ 1101 
^BamW I 1107 

-BssW II 1148 

Srf I 1522 
V^Sma I 1523 
-BstX 1 1635 

-Sal I 1782 
-0pu11O2 I 1967 

2196 

III 2396 
2402 
2502 



fowl 105 I 3420 
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pDEST14 6422 bp (rotated to position 4000) 



Location (Base Nos. ) 
185.. 61 



Gene Encoded 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 



435 . . 1094 
1214.. 1298 
1436 . . 1741 
1782.. 1906 
2632 . . 3489 



1 CGATCCCGCG AAATTAATAC GACTCACTAT AGGGAGACCA CAACGGTTTC CCTCTAGATC 
61 ACAAGTTTGT ACAAAAAAGC TGAACGAGAA ACGTAAAATG ATATAAATAT CAATATATTA 
121 AATTAGATTT TGCATAAAAA ACAGACTACA TAATACTGTA AAACACAACA TATCCAGTCA 
181 CTATGGCGGC CGCTAAGTTG GCAGCATCAC CCGACGCACT TTGCGCCGAA TAAATACCTG 
241 TGACGGAAGA TCACT TCGCA GAATAAATAA ATCCTGGTGT CCCTGTTGAT ACCGGGAAGC 
301 CCTGGGCCAA CTTTTGGCGA AAATGAGACG TTGATCGGCA CGTAAGAGGT TCCAACTTTC 
361 ACCATAATGA AATAAGATCA CTACCGGGCG TAI 1 1 1 I IGA GTTATCGAGA TTTTCAGGAG 
421 CTAAGGAAGC TAAAATGGAG AAAAAAATCA CTGGATATAC CAGCGTTGAT ATATCCCAAT 
481 GGCATCGTAA AGAACATTTT GAGGCATTTC AGTCAGTTGC TCAATGTACC TATAACCAGA 
541 CCGTTCAGCT GGATATTACG GCCTTTTTAA AGACCGTAAA GAAAAATAAG CACAAGTTTT 
601 ATCCGGCCTT TATTCACATT CTTGCCCGCC TGATGAATGC TCATCCGGAA TTCCGTATGG 
661 CAATGAAAGA CGGTGAGCTG GTGATATGGG ATAGTGTTCA CCCTTGTTAC ACCGTTTTCC 
721 ATGAGCAAAC TGAAACGTTT TCATCGCTCT GGAGTGAATA CCACGACGAT TTCCGGCAGT 
781 TTCTACACAT ATATTCGCAA GATGT GGCGT GTTACGGTGA AAACCTGGCC TATTTCCCTA 
841 AAGGGTTTAT TGAGAATATG TTTTTCGTGT CAGCCAATCC CTGGGTGAGT TTCACCAGTT 
901 TTGATTTAAA CGTGGCCAAT ATGGACAACT TCTTCGCCCC CGTTTTCACC ATGGGCAAAT 
961 ATTATACGCA AGGCGACAAG GTGCTGATGC CGCTGGCGAT TCAGGTTCAT CATGCCGTCT 
1021 GTGATGGCTT CCATGTCGGC AGAATGCTTA ATGAATTACA ACAGTACTGC GATGAGTGGC 
1081 AGGGCGGGGC GTAAA CGCGT GGATCCGGCT TACTAAAAGC CAGATAACAG TATGCGTATT 
1141 TGCGCGCTGA TTTTTGCGGT ATAAGAATAT ATACTGATAT GTATACCCGA AGTATGTCAA 
1201 AAAGAGGTGT GCTATGAAGC AGCGTATTAC AGTGACAGTT GACAGCGACA GCTATCAGTT 
1261 GCTCAAGGCA TATATGATGT CAATATCTCC GGTCTGGTAA GCACAACCAT GCAGAATGAA 
1321 GCCCGTCGTC TGCGTGCCGA ACGCTGGAAA GCGGAAAATC AGGAAGGGAT GGCTGAGGTC 
1381 GCCCGGTTTA TTGAAATGAA CGGCTCTTTT GCTGACGAGA ACAGGGACTG GTGAAATGCA 
1441 GTTTAAGGTT TACACCTATA AAAGAGAGAG CCGTTATCGT CTGTTTGTGG ATGTACAGAG 
1501 TGATATTATT GACACGCCCG GGCGACGGAT GGTGATCCCC CTGGCCAGTG CACGTCTGCT 
1561 GTCAGATAAA GTCTCCCGTG AACTTTACCC GGTGGTGCAT ATCGGGGATG AAAGCTGGCG 
1621 CATGATGACC ACCGATATGG CCAGTGTGCC GGTCTCCGTT ATCGGGGMG AAGTGGCTGA 
1681 TCTCAGCCAC CGCGAAAATG ACATCAAAAA CGCCATTAAC CTGATGTTCT GGGGAATATA 
1741 AAT GTCA GGC TCCCTTATAC ACAGCCAGTC TGCAGGTCGA CCATAGTGAC TGGATATGTT 
1801 GTGTTTTACA GTATTATGTA GTCTGTTTTT TATGCAAAAT CTAATTTAAT ATATTGATAT 
1861 TTATATCATT TTACGTTTCT CGTTCAGCTT TCTTGTACAA AGTGGTGATG ATCCGGCTGC 
1921 TAACAAAGCC CGAAAGGAAG CTGAGTTGGC TGCTGCCACC GCTGAGCAAT AACTAGCATA 
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1981 ACCCCTTGGG GCCTCTAAAC GGGTCTTGAG 
2041 CGGATATCCA CAGGACGGGT GTGGTCGCCA 
2101 GCGAAGCGAG CAGGACTGGG CGGCGGCCAA 
2161 CGCATAGAAA TTGCATCAAC GCATATAGCG 
2221 TGTCGGAATG GACGATATCC CGCAAGAGGC 
2281 CTACAGCATC CAGGGTGACG GTGCCGAGGA 
2341 ACGGTGCCTG ACTGCGTTAG CAATTTAACT 
2401 TGATAAGCTG TCAAACATGA GAATTCTTGA 
2461 TTATAGGTTA ATGTCATGAT AATAATGGTT 
2521 AATGTGCGCG GAACCCCTAT TTGTTTATTT 
2581 ATGAGACAAT AACCCTGATA AATGCTTCAA 
2641 CAACATTTCC GTGTCGCCCT TATTCCCTTT 
2701 CACCCAGAAA CGCTGGTGAA AGTAAAAGAT 
2761 TACATCGAAC TGGATCTCAA CAGCGGTAAG 
2821 TTTCCAATGA TGAGCACTTT TAAAGTTCTG 
2881 GCCGGGCAAG AGCAACTCGG TCGCCGCATA 
2941 TCACCAGTCA CAGAAAAGCA TCTTACGGAT 
3001 GCCATAACCA TGAGTGATAA CACTGCGGCC 
3061 AAGGAGCTAA CCGC I I 1 1 I I GCACAACATG 
3121 GAACCGGAGC TGAATGAAGC CATACCAAAC 
3181 ATGGCAACAA CGTTGCGCAA ACTATTAACT 
3241 CAATTAATAG ACTGGATGGA GGCGGATAAA 
3301 CCGGCTGGCT GGTTTATTGC TGATAAATCT 
3361 ATTGCAGCAC TGGGGCCAGA TGGTAAGCCC 
3421 AGTCAGGCAA CTATGGATGA ACGAAATAGA 
3481 AAGCATTGGT AACTGTCAGA CCAAGTTTAC 
3541 CAI I I I IAAT TTAAAAGGAT CTAGGTGAAG 
3601 CCTTAACGTG AGTTTTCGTT CCACTGAGCG 
3661 TCTTGAGATC CI I 1 1 I I ICT GCGCGTAATC 
3721 CCAGCGGTGG TTTGTTTGCC GGATCAAGAG 
3781 TTCAGCAGAG CGCAGATACC AAATACTGTC 
3841 TTCAAGAACT CTGTAGCACC GCCTACATAC 
3901 GCTGCCAGTG GCGATAAGTC GTGTCTTACC 
3961 AAGGCGCAGC GGTCGGGCTG AACGGGGGGT 
4021 ACCTACACCG AACTGAGATA CCTACAGCGT 
4081 GGGAGAAAGG CGGACAGGTA TCCGGTAAGC 
4141 GAGCTTCCAG GGGGAAACGC CTGGTATCTT 
4201 CTTGAGCGTC GATTTTTGTG ATGCTCGTCA 
4261 AACGCGGCCT TTTTACGGTT CCTGGCCTTT 
4321 GCGTTATCCC CTGATTCTGT GGATAACCGT 
4381 CGCCGCAGCC GAACGACCGA GCGCAGCGAG 



GGGI MING CTGAAAGGAG GAACTATATC 
TGATCGCGTA GTCGATAGTG GCTCCAAGTA 
AGCGGTCGGA CAGTGCTCCG AGAACGGGTG 
CTAGCAGCAC GCCATAGTGA CTGGCGATGC 
CCGGCAGTAC CGGCATAACC AAGCCTATGC 
TGACGATGAG CGCATTGTTA GATTTCATAC 
GTGATAAACT ACCGCATTAA AGCTTATCGA 
AGACGAAAGG GCCTCGTGAT ACGCCTATTT 
TCTTAGACGT CAGGTGGCAC TTTTCGGGGA 
TTCTAAATAC ATTCAAATAT GTATCCGCTC 
TAATATTGAA AAAGGAAGAG TATGAGTATT 
TTTGCGGCAT TTTGCCTTCC TGTTTTTGCT 
GCTGAAGATC AGTTGGGTGC ACGAGTGGGT 
ATCCTTGAGA GTTTTCGCCC CGAAGAACGT 
CTATGTGGCG CGGTATTATC CCGTGTTGAC 
CACTATTCTC AGAATGACTT GGTTGAGTAC 
GGCATGACAG TAAGAGAATT ATGCAGTGCT 
AACTTACTTC TGACAACGAT CGGAGGACCG 
GGGGATCATG TAACTCGCCT TGATCGTTGG 
GACGAGCGTG ACACCACGAT GCCTGCAGCA 
GGCGAACTAC TTACTCTAGC TTCCCGGCAA 
GTTGCAGGAC CACTTCTGCG CTCGGCCCTT 
GGAGCCGGTG AGCGTGGGTC TCGCGGTATC 
TCCCGTATCG TAGTTATCTA CACGACGGGG 
CAGATCGCTG AGATAGGTGC CTCACTGATT 
TCATATATAC TTTAGATTGA TTTAAAACTT 
ATCCTTTTTG ATAATCTCAT GACCAAAATC 
TCAGACCCCG TAGAAAAGAT CAAAGGATCT 
TGCTGCTTGC AAACAAAAAA ACCACCGCTA 
CTACCAACTC TTTTTCCGAA GGTAACTGGC 
CTTCTAGTGT AGCCGTAGTT AGGCCACCAC 
CTCGCTCTGC TAATCCTGTT ACCAGTGGCT 
GGGTTGGACT CAAGACGATA GTTACCGGAT 
TCGTGCACAC AGCCCAGCTT GGAGCGAACG 
GAGCTATGAG AAAGCGCCAC GCTTCCCGAA 
GGCAGGGTCG GAACAGGAGA GCGCACGAGG 
TATAGTCCTG TCGGGTTTCG CCACCTCTGA 
GGGGGGCGGA GCCTATGGAA AAACGCCAGC 
TGCTGGCCTT TTGCTCACAT GTTCTTTCCT 
ATTACCGCCT TTGAGTGAGC TGATACCGCT 
TCAGTGAGCG AGGAAGCGGA AGAGCGCCTG 
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4441 ATGCGGTATT TTCTCCTTAC GCATCTGTGC 
4501 TCAGTACAAT CTGCTCTGAT GCCGCATAGT 
4561 TGACTGGGTC ATGGCTGCGC CCCGACACCC 
4621 TTGTCTGCTC CCGGCATCCG CTTACAGACA 
4681 TCAGAGGTTT TCACCGTCAT CACCGAAACG 
4741 GTGGTCGTGA AGCGATTCAC AGATGTCTGC 
4801 CTCCAGAAGC GTTAATGTCT GGCTTCTGAT 
4861 CTGTTTGGTC ACTGATGCCT CCGTGTAAGG 
4921 GATGAAACGA GAGAGGATGC TCACGATACG 
4981 GGAACGTTGT GAGGGTAAAC AACTGGCGGT 
5041 TCAGGGTCAA TGCCAGCGCT TCGTTMTAC 
5101 GCATCCTGCG ATGCAGATCC GGAACATMT 
5161 ACTTTACGAA ACACGGAAAC CGAAGACCAT 
5221 GCAGCAGCAG TCGCTTCACG TTCGCTCGCG 
5281 GCAACCCCGC CAGCCTAGCC GGGTCCTCAA 
5341 CCAGGACCCA ACGCTGCCCG AGATGCGCCG 
5401 GGATATGTTC TGCCAAGGGT TGGTTTGCGC 
5461 TCCAATTCTT GGAGTGGTGA ATCCGTTAGC 
5521 GTGGCCCGGC TCCATGCACC GCGACGCAAC 
5581 CCTACAATCC ATGCCAACCC GTTCCATGTG 
5641 ATCAGCGGTC CAGTGATCGA AGTTAGGCTG 
5701 CCCTGATGGT CGTCATCTAC CTGCCTGGAC 
5761 CCGCCGGAAG CGAGAAGAAT CATAATGGGG 
5821 AGCAAGACGT AGCCCAGCGC GTCGGCCGCC 
5881 AAACGTTTGG TGGCGGGACC AGTGACGAAG 
5941 ACCGCAAGCG ACAGGCCGAT CATCGTCGCG 
6001 ACCCAGAGCG CTGCCGGCAC CTGTCCTACG 
6061 GCGGCGACGA TAGTCATGCC CCGCGCCCAC 
6121 AAGGGCATCG GTCGATCGAC GCTCTCCCTT 
6181 TAGTAGGTTG AGGCCGTTGA GCACCGCCGC 
6241 GCCCAACAGT CCCCCGGCCA CGGGGCCTGC 
6301 GAGCCCGAAG TGGCGAGCCC GATCTTCCCC 
6361 AACCGCACCT GTGGCGCCGG TGATGCCGGC 
6421 CT 



GGTATTTCAC ACCGCATATA TGGTGCACTC 
TAAGCCAGTA TACACTCCGC TATCGCTACG 
GCCAACACCC GCTGACGCGC CCTGACGGGC 
AGCTGTGACC GTCTCCGGGA GCTGCATGTG 
CGCGAGGCAG CTGCGGTAAA GCTCATCAGC 
CTGTTCATCC GCGTCCAGCT CGTTGAGTTT 
AAAGCGGGCC ATGTTAAGGG CGG I I I I I IC 
GGGATTTCTG TTCATGGGGG TAATGATACC 
GGTTACTGAT GATGAACATG CCCGGTTACT 
ATGGATGCGG CGGGACCAGA GAAAAATCAC 
AGATGTAGGT GTTCCACAGG GTAGCCAGCA 
GGTGCAGGGC GCTGACTTCC GCGTTTCCAG 
TCATGTTGTT GCTCAGGTCG CAGACGTTTT 
TATCGGTGAT TCATTCTGCT AACCAGTAAG 
CGACAGGAGC ACGATCATGC GCACCCGTGG 
CGTGCGGCTG CTGGAGATGG CGGACGCGAT 
ATTCACAGTT CTCCGCAAGA ATTGATTGGC 
GAGGTGCCGC CGGCTTCCAT TCAGGTCGAG 
GCGGGGAGGC AGACAAGGTA TAGGGCGGCG 
CTCGCCGAGG CGGCATAAAT CGCCGTGACG 
GTAAGAGCCG CGAGCGATCC TTGAAGCTGT 
AGCATGGCCT GCAACGCGGG CATCCCGATG 
AAGGCCATCC AGCCTCGCGT CGCGAACGCC 
ATGCCGGCGA TAATGGCCTG CTTCTCGCCG 
GCTTGAGCGA GGGCGTGCAA GATTCCGAAT 
CTCCAGCGAA AGCGGTCCTC GCCGAAAATG 
AGTTGCATGA TAAAGAAGAC AGTCATAAGT 
CGGAAGGAGC TGACTGGGTT GAAGGCTCTC 
ATGCGACTCC TGCATTAGGA AGCAGCCCAG 
CGCAAGGAAT GGTGCATGCA AGGAGATGGC 
CACCATACCC ACGCCGAAAC AAGCGCTCAT 
ATCGGTGATG TCGGCGATAT AGGCGCCAGC 
CACGATGCGT CCGGCGTAGA GGATCGAGAT 
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pDes 1 1 5 Glutathione-5-t.ransferase Fusion in E. 

T7 Promoter 



wOl i 



T7 Promoter 
mRNA 



ata cga etc 
tat get gag 



1 not cga gat etc gat ccc gcg aaa tta 
nta get eta gag eta ggg cgc ttt cat 

Xbal 

52 caa egg ttt ccc t [ct ag a aat aat ttt gtt taa ctt 
gtt gec aaa ggg aga tcjt tta tta aaa caa att gaa 

NdeirT S P I L 

103 cajt_ atg tec cct ata eta ggt tat tgg 
gtaljac agg gga tat gat cca ata acc 
Start Translation GST 



act ata 



ggg 



tga tat ccc 



aga cca 
tct ggt 



aaa att aag 
ttt taa ttc 



taa gaa gga gat ata 
att ctt cct eta tat 



ggc ctt gtg caa ccc 
ccg gaa cac gtt ggg 



act cga 


ctt 


ctt 


ttg gaa 


tat 


ctt gaa 


gaa 


aaa 


tat 


gaa 


gag 


cat 


ttg 


tat 


tga get 


gaa 


gaa 


aac ctt 


ata 


gaa ctt 


ctt 


ttt 


ata 


ctt 


etc 


gta 


aac 


ata 


cag ggc 


tgg 


caa 


gee acg 


ttt 


ggt ggt 


ggc 


gac 


cat 


cct 


cca 


aaa 


teg 


gat 


gtc ccg 


acc 


gtt 


egg tgc 


aaa 


cca cca 


ccg 


ctg 


gta 


gga 


ggt 


ttt 


age 


eta 










S 


N Q 


T 


S 


L 


- Y 


K 


K 


A 





766 ctg gtt ccg cgt cca tgg teg aat caa 
gac caa ggc gca ggt acc age tta gtt 



aco agt ttg tac aoa aaa get gaa 
tgt tea aac otg tttlttt cga ctt 

attRI InT* 
817 cga gaa acg too aat gat ota aat ate oat ata tta oat tag att ttg cat 
get ctt tgc att tta eta tat tta tag tta tat aat tta ate taa aac gta 

Sgrk I 6976 Bgl I 1 5 
Sph I 6825 \ 1 Nsp I 102 , C , 529 
PshA I 6667. )/ Bel I 539 

Nru I 6411^-^ w_T^ ^C~ 

Not 



TthU\ I 5163 
BsaA I 5158 
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U BssH 1 1 1732 



Srf I 2106 

BstX I 2219 

Sal I 2366 

5pu1102 I 2562 



Earn 1105 I 4015 



2791 

Hind III 2991 
i Cla I 2997 
Aat II 3097 
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PDEST15 7013 bp 



Location (Base Nos.) 
108. .776 
916.. 792 



GST 

attRl 

CmR 



Gene Encoded 



1025.. 1537 
1804 . . 1888 
2026.. 2331 
2372 . . 2496 
3233.. 4093 



ccdB 

attR2 

ampR 



inactivated ccdA 



1 ATCGAGATCT CGATCCCGCG AAATTAATAC GACTCACTAT AGGGAGACCA CAACGGTTTC 
61 CCTCTAGAAA TAATTTTGTT TAACTTTAAG AAGGAGATAT ACATATGTCC CCTATACTAG 
121 GTTATTGGAA AATTAAGGGC CTTGTGCAAC CCACTCGACT TCTTTTGGAA TATCTTGAAG 
181 AAAAATATGA AGAGCATTTG TATGAGCGCG ATGAAGGTGA TAAATGGCGA AACAAAAAGT 
241 TTGAATTGGG TTTGGAGTTT CCCAATCTTC CTTATTATAT TGATGGTGAT GTTAAATTAA 
301 CACAGTCTAT GGCCATCATA CGTTATATAG CTGACAAGCA CAACATGTTG GGTGGTTGTC 
361 CAAAAGAGCG TGCAGAGATT TCAATGCTTG AAGGAGCGGT TTTGGATATT AGATACGGTG 
421 TTTCGAGAAT TGCATATAGT AAAGACTTTG AAACTCTCAA AGTTGATTTT CTTAGCAAGC 
481 TACCTGAAAT GCTGAAAATG TTCGAAGATC GTTTATGTCA TAAAACATAT TTAAATGGTG 
541 ATCATGTAAC CCATCCTGAC TTCATGTTGT ATGACGCTCT TGATGTTGTT TTATACATGG 
601 ACCCAATGTG CCTGGATGCG TTCCCAAAAT TAGTTTGTTT TAAAAAACGT ATTGAAGCTA 
661 TCCCACAAAT TGATAAGTAC TTGAAATCCA GCAAGTATAT AGCATGGCCT TTGCAGGGCT 
721 GGCAAGCCAC GTTTGGTGGT GGCGACCATC CTCCAAAATC GGATCTGGTT CCGCGTCCAT 
781 GGTCGAATCA AACAAGTTTG TACAAAAAAG CTGAACGAGA AACGTAAAAT GATATAAATA 
841 TCAATATATT AAATTAGATT TTGCATAAAA AACAGACTAC ATAATACTGT AAAACACAAC 
901 ATATCCAGTC ACTATGGCGG CCGCATTAGG CACCCCAGGC TTTACACTTT ATGCTTCCGG 
961 CTCGTATAAT GTGTGGATTT TGAGTTAGGA TCCGTCGAGA TTTTCAGGAG CTAAGGAAGC 
1021 TAAAA TGGAG AAAAAAATCA CTGGATATAC CACCGTTGAT ATATCCCAAT GGCATCGTAA 
1081 AGAACATTTT GAG GCATTT C AGTCAGTTGC TCAATGTACC TATAACCAGA CCGTTCAGCT 
1141 GGATATTACG GCCTTTTTAA AGACCGTAAA GAAAAATAAG CACAAGTTTT ATCCGGCCTT 
1201 TATTCACATT CTTGCCCGCC TGATGAATGC TCATCCGGAA TTCCGTATGG CAATGAAAGA 
1261 CGGTGAGCTG GTGATATGGG ATAGTGTTCA CCCTTGTTAC ACCGTTTTCC ATGAGCAAAC 
1321 TGAAACGTTT TCATCGCTCT GGAGTGAATA CCACGACGAT TTCCGGCAGT TTCTACACAT 
1381 ATATTCGCAA GATGT GGCGT GTTACGGTGA AAACCTGGCC TATTTCCCTA AAGGGTTTAT 
1441 TGAGAATATG TTTTTCGTCT CAGCCAATCC CTGGGTGAGT TTCACCAGTT TTGATTTAAA 
1501 CGTGGCCAAT ATGGACAACT TCTTCGCCCC CGTTTTCACC ATGGGCAAAT ATTATACGCA 
1561 AGGCGACAAG GTGCTGATGC CGCTGGCGAT TCAGGTTCAT CATGCCGTCT GTGATGGCTT 
1621 CCATGTCGGC AGAATGCTTA ATGAATTACA ACAGTACTGC GATGAGTGGC AGGGCGGGGC 
1681 GTAAT CTAGA GGATCCGGCT TACTAAAAGC CAGATAACAG TATGCGTATT TGCGCGCTGA 
1741 TTTTTGCGGT ATAAGAATAT ATACTGATAT GTATACCCGA AGTATGTCAA AAAGAGGTGT 
1801 GCTATGAAGC AGCGTATTAC AGTGACAGTT GACAGCGACA GCTATCAGTT GCTCAAGGCA 



FIG.35B 



97/253 



1861 TATATGATGT CAATATCTCC GGTCTGGTAA GCACAACCAT GCAGAATGAA GCCCGTCGTC 
1921 TGCGTGCCGA ACGCTGGAAA GCGGAAAATC AGGAAGGGAT GGCTGAGGTC GCCCGGTTTA 
1981 TTGAAATGAA CGGCTCTTTT GCTGACGAGA ACAGGGACTG GTGAAATGCA GTTTAAGGTT 
2041 TACACCTATA AAAGAGAGAG CCGTTATCGT CTGTTTGTGG ATGTACAGAG TGATATTATT 
2101 GACACGCCCG GGCGACGGAT GGTGATCCCC CTGGCCAGTG CACGTCTGCT GTCAGATAAA 
2161 GTCTCCCGTG AACTTTACCC GGTGGTGCAT ATCGGGGATG AAAGCTGGCG CATGATGACC 
2221 ACCGATATGG CCAGTGTGCC GGTCTCCGTT ATCGGGGAAG AAGTGGCTGA TCTCAGCCAC 
2281 CGCGAAAATG ACATCAAAAA CGCCATTAAC CTGATGTTCT GGGGAATATA AATGTCAGGC 
2341 TCCCTTATAC ACAGC CAGTC TGCAGGTCGA CCATAGTGAC TGGATATGTT GTGTTTTACA 
2401 GTATTATGTA GTCTGTTTTT TATGCAAAAT CTAATTTAAT ATATTGATAT TTATATCATT 
2461 TTACGTTTCT CGTTCAGCTT TCTTGTACAA AGTGGTTTGA TTCGACCCGG GATCCGGCTG 
2521 CTAACAAAGC CCGAAAGGAA GCTGAGTTGG CTGCTGCCAC CGCTGAGCAA TAACTAGCAT 
2581 AACCCCTTGG GGCCTCTAAA CGGGTCTTGA GGGGI 1 1 1 I I GCTGAAAGGA GGAACTATAT 
2641 CCGGATATCC ACAGGACGGG TGTGGTCGCC ATGATCGCGT AGTCGATAGT GGCTCCAAGT 
2701 AGCGAAGCGA GCAGGACTGG GCGGCGGCCA AAGCGGTCGG ACAGTGCTCC GAGAACGGGT 
2761 GCGCATAGAA ATTGCATCAA CGCATATAGC GCTAGCAGCA CGCCATAGTG ACTGGCGATG 
2821 CTGTCGGAAT GGACGATATC CCGCAAGAGG CCCGGCAGTA CCGGCATAAC CAAGCCTATG 
2881 CCTACAGCAT CCAGGGTGAC GGTGCCGAGG ATGACGATGA GCGCATTGTT AGATTTCATA 
2941 CACGGTGCCT GACTGCGTTA GCAATTTAAC TGTGATAAAC TACCGCATTA AAGCTTATCG 
3001 ATGATAAGCT GTCAAACATG AGAATTCTTG AAGACGAAAG GGCCTCGTGA TACGCCTATT 
3061 TTTATAGGTT AATGTCATGA TAATAATGGT TTCTTAGACG TCAGGTGGCA CTTTTCGGGG 
3121 AAATGTGCGC GGAACCCCTA TTTGTTTATT TTTCTAAATA CATTCAAATA TGTATCCGCT 
3181 CATGAGACAA TAACCCTGAT AAATGCTTCA ATAATATTGA AAAAGGAAGA GTATGAGTAT 
3241 TCAACATTTC CGTGTCGCCC TTATTCCCTT TTTTGCGGCA TTTTGCCTTC CTGTTTTTGC 
3301 TCACCCAGAA ACGCTGGTGA AAGTAAAAGA TGCTGAAGAT CAGTTGGGTG CACGAGTGGG 
3361 TTAC ATCGAA CTGGATCTCA ACAGCGGTAA GATCCTTGAG AGTTTTCGCC CCGAAGAACG 
3421 TTTTCCAATG ATGAGCACTT TTAAAGTTGT GCTATGTGGC GCGGTATTAT CCCGTGTTGA 
3481 CGCCGGGCAA GAGCAACTCG GTCGCCGCAT ACACTATTCT CAGAATGACT TGGTTGAGTA 
3541 CTCACCAGTC ACAGAAAAGC ATCTTACGGA TGGCATGACA GTAAGAGAAT TATGCAGTGC 
3601 TGCCATAACC ATGAG TGATA ACACTGCGGC CAACTTACTT CTGACAACGA TCGGAGGACC 
3661 GAAGGAGCTA ACCGCTTTTT TGCACAACAT GGGGGATCAT GTAACTCGCC TTGATCGTTG 
3721 GGAACCGGAG CTGAATGAAG CCATACCAAA CGACGAGCGT GACACCACGA TGCCTGCAGC 
3781 AATGGCAACA ACGTTGCGCA AACTATTAAC TGGCGAACTA CTTACTCTAG CTTCCCGGCA 
3841 ACAATTAATA GACTGGATGG AGGCGGATAA AGTTGCAGGA CCACTTCTGC GCTCGGCCCT 
3901 TCCGGCTGGC TGGTTTATTG CTGATAAATC TGGAGCCGGT GAGCGTGGGT CTCGCGGTAT 
3961 CATTGCAGCA CTGGGGCCAG ATGGTAAGCC CTCCCGTATC GTAGTTATCT ACACGACGGG 
4021 GAGTCAGGCA ACTATGGATG AACGAAATAG ACAGATCGCT GAGATAGGTG CCTCACTGAT 
4081 TAA GCATT GG TAACTGTCAG ACCAAGTTTA CTCATATATA CTTTAGATTG ATTTAAAACT 
4141 TCATTTTTAA TTTAAAAGGA TCTAGGTGAA GATCCTTTTT GATAATCTCA TGACCAAAAT 
4201 CCCTTAACGT GA GTTTTCGT TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC 
4261 TTCTTGAGAT CCI I I I I I IC TGCGCGTAAT CTGCTGCTTG CAAACAAAAA AACCACCGCT 
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4321 ACCAGCGGTG GTTTGTTTGC CGGATCAAGA 
4381 CTTCAGCAGA GCGCAGATAC CAAATACTGT 
4441 CTTCAAGAAC TCTGTAGCAC CGCCTACATA 
4501 TGCTGCCAGT GGCGATAAGT CGTGTCTTAC 
4561 TAAGGCGCAG CGGTCGGGCT GAACGGGGGG 
4621 GACCTACACC GAACTGAGAT ACCTACAGCG 
4681 AGGGAGAAAG GCGGACAGGT ATCCGGTAAG 
4741 GGAGCTTCCA GGGGGAAACG CCTGGTATCT 
4801 ACTTGAGCGT CGATTTTTGT GATGCTCGTC 
4861 CAACGCGGCC TTTTTACGGT TCCTGGCCTT 
4921 TGCGTTATCC CCTGATTCTG TGGATAACCG 
4981 TCGCCGCAGC CGAACGACCG AGCGCAGCGA 
5041 GATGCGGTAT TTTCTCCTTA CGCATCTGTG 
5101 CTCAGTACAA TCTGCTCTGA TGCCGCATAG 
5161 GTGACTGGGT CATGGCTGCG CCCCGACACC 
5221 CTTGTCTGCT CCCGGCATCC GCTTACAGAC 
5281 GTCAGAGGTT TTCACCGTCA TCACCGAAAC 
5341 CGTGGTCGTG AAGCGATTCA CAGATGTCTG 
5401 TCTCCAGAAG CGTTAATGTC TGGCTTCTGA 
5461 CCTGTTTGGT CACTGATGCC TCCGTGTAAG 
5521 CGATGAAACG AGAGAGGATG CTCACGATAC 
5581 TGGAACGTTG TGAGGGTAAA CAACTGGCGG 
5641 CTCAGGGTCA ATGCCAGCGC TTCGTTAATA 
5701 AGCATCCTGC GATGCAGATC CGGAACATAA 
5761 GACTTTACGA AACACGGAAA CCGAAGACCA 
5821 TGCAGCAGCA GTCGCTTCAC GTTCGCTCGC 
5881 GGCAACCCCG CCAGCCTAGC CGGGTCCTCA 
5941 GCCAGGACCC AACGCTGCCC GAGATGCGCC 
6001 TGGATATGTT CTGCCAAGGG TTGGTTTGCG 
6061 CTCCAATTCT TGGAGTGGTG AATCCGTTAG 
6121 GGTGGCCCGG CTCCATGCAC CGCGACGCAA 
6181 GCCTACAATC GATGCCAACC CGTTCCATGT 
6241 GATCAGCGGT CCAGTGATCG MGTTAGGCT 
6301 TCCCTGATGG TCGTCATCTA CCTGCCTGGA 
6361 GCCGCCGGAA GCGAGAAGAA TCATAATGGG 
6421 CAGCAAGACG TAGCCCAGCG CGTCGGCCGC 
6481 GAAACGTTTG GTGGCGGGAC CAGTGACGAA 
6541 TACCGCAAGC GACAGGCCGA TCATCGTCGC 
6601 GACCCAGAGC GCTGCCGGCA CCTGTCCTAC 
6661 TGCGGCGACG ATAGTCATGC CCCGCGCCCA 
6721 CAAGGGCATC GGTCGATCGA CGCTCTCCCT 
6781 GTAGTAGGTT GAGGCCGTTG AGCACCGCCG 
6841 CGCCCAACAG TCCCCCGGCC ACGGGGCCTG 
6901 TGAGCCCGAA GTGGCGAGCC CGATCTTCCC 
6961 CAACCGCACC TGTGGCGCCG GTGATGCCGG 



GCTACCAACT CTTTTTCCGA AGGTAACTGG 
CCTTCTAGTG TAGCCGTAGT TAGGCCACCA 
CCTCGCTCTG CTAATCCTGT TACCAGTGGC 
CGGGTTGGAC TCAAGACGAT AGTTACCGGA 
TTCGTGCACA CAGCCCAGCT TGGAGCGAAC 
TGAGCTATGA GAAAGCGCCA CGCTTCCCGA 
CGGCAGGGTC GGAACAGGAG AGCGCACGAG 
TTATAGTCCT GTCGGGTTTC GCCACCTCTG 
AGGGGGGCGG AGCCTATGGA AAAACGCCAG 
TTGCTGGCCT TTTGCTCACA TGTTCTTTCC 
TATOCCGCC TTTGAGTGAG CTGATACCGC 
GTCAGTGAGC GAGGAAGCGG AAGAGCGCCT 
CGGTATTTCA CACCGCATAT ATGGTGCACT 
TTAAGCCAGT ATACACTCCG CTATCGCTAC 
CGCCAACACC CGCTGACGCG CCCTGACGGG 
AAGCTGTGAC CGTCTCCGGG AGCTGCATGT 
GCGCGAGGCA GCTGCGGTAA AGCTCATCAG 
CCTGTTCATC CGCGTCCAGC TCGTTGAGTT 
TAAAGCGGGC CATGTTAAGG GCGGI 1 1 1 1 1 
GGGGATTTCT GTTCATGGGG GTAATGATAC 
GGGTTACTGA TGATGAACAT GCCCGGTTAC 
TATGGATGCG GCGGGACCAG AGAAAAATCA 
CAGATGTAGG TGTTCCACAG GGTAGCCAGC 
TGGTGCAGGG CGCTGACTTC CGCGTTTCCA 
TTCATGTTGT TGCTCAGGTC GCAGACGTTT 
GTATCGGTGA TTCATTCTGC TAACCAGTAA 
ACGACAGGAG CACGATCATG CGCACCCGTG 
GCGTGCGGCT GCTGGAGATG GCGGACGCGA 
CATTCACAGT TCTCCGCAAG AATTGATTGG 
CGAGGTGCCG CCGGCTTCCA TTCAGGTCGA 
CGCGGGGAGG CAGACAAGGT ATAGGGCGGC 
GCTCGCCGAG GCGGCATAAA TCGCCGTGAC 
GGTAAGAGCC GCGAGCGATC CTTGAAGCTG 
CAGCATGGCC TGCAACGCGG GCATCCCGAT 
GAAGGCCATC CAGCCTCGCG TCGCGAACGC 
CATGCCGGCG ATAATGGCCT GCTTCTCGCC 
GGCTTGAGCG AGGGCGTGCA AGATTCCGAA 
GCTCCAGCGA AAGCGGTCCT CGCCGAAAAT 
GAGTTGCATG ATAAAGAAGA CAGTCATAAG 
CCGGAAGGAG CTGACTGGGT TGAAGGCTCT 
TATGCGACTC CTGCATTAGG AAGCAGCCCA 
CCGCAAGGAA TGGTGCATGC AAGGAGATGG 
CCACCATACC CACGCCGAAA CAAGCGCTCA 
CATCGGTGAT GTCGGCGATA TAGGCGCCAG . 
CCACGATGCG TCCGGCGTAG AGG 
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Thioredoxin N-Fusion Protein in E. coli with T7 Promoter 

T7 Promoter mRNA 

1 got etc got ccc gcg oaa t|to ata ego etc act atolggg 
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Xbal 
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T S L Y K K A 



ottRI 



460 



aca agt tt [g tac oaa oaa get gao ego goo ocg too oat got ota oat ate 
tgt tea aac otg ttt|ttt ego ctt get ctt tgc ott tto eta tot tto tog 



Tnt| 



Nrul 
Bs tAP I 



EcoNI \ \t y /O^ d , 
PshAt * * > - Rsrl 



Bsgl_ 



Tthllll- 

BsaAl' 

Sap I / 
BspLU11 



FIG.36A 




Ncol 

Ml u I 
BssHI I 

-BbvCI 

f~VSrf I 
\Smal 

X BstXI 
Sail 
Bpu1 1021 



Nhel 
Hindi 1 1 



Ahdl 



Aat 1 1 



100/253 



pDEST16 6675 bp 



Location (Base Nos.) 



Gene Encoded 

trxA 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 



104.. 457 
585.. 461 



694.. 1353 
1473 . . 1557 
1695.. 2000 
2041.. 2165 



1 AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG AGACCACAAC GGTTTCCCTC 
61 TAGAAATAAT TTTGTTTAAC TTTAAGAAGG AGATATACAT ATGAGCGATA AAATTATTCA 
121 CCTGACTGAC GACAGTTTTG ACACGGATGT ACTCAAAGCG GACGGGGCGA TCCTCGTCGA 
181 TTTCTGGGCA GAGTGGTGCG GTCCGTGCAA AATGATCGCC CCGATTCTGG ATGAAATCGC 
241 TGACGAATAT CAGGGCAAAC TGACCGTTGC AAAACTGAAC ATCGATCAAA ACCCTGGCAC 
301 TGCGCCGAAA TATGGCATCC GTGGTATCCC GACTCTGCTG CTGTTCAAAA ACGGTGAAGT 
361 GGCGGCAACC AAAGTGGGTG CACTGTCTAA AGGTCAGTTG AAAGAGTTCC TCGACGCTAA 
421 CCTGGCCGGT TCTGGTTCTG GTGATGACGA TGACAAGATC ACAAGTTTGT ACAAAAAAGC 
481 TGAACGAGAA ACGTAAAATG ATATAAATAT CAATATATTA AATTAGATTT TGCATAAAAA 
541 ACAGACTACA TAATACTGTA AAACACAACA TATCCAGTCA CTATGGCGGC CGCATTAGGC 
601 ACCCCAGGCT TTACACTTTA TGCTTCCGGC TCGTATAATG TGTGGATTTT GAGTTAGGAT 
661 CCGGCGAGAT TTTCAGGAGC TAAGGAAGCT AAAATGGAGA AAAAAATCAC TGGATATACC 
721 ACCGTTGATA TATCCCAATG GCATCGTAAA GAACATTTTG AGGCATTTCA GTCAGTTGCT 
781 CAATGTACCT ATAACCAGAC CGTTCAGCTG GATATTACGG CCTTTTTAAA GACCGTAAAG 
841 AAAAATAAGC ACAAGTTTTA TCCGGCCTTT ATTCACATTC TTGCCCGCCT GATGAATGCT 
901 CATCCGGAAT TCCGTATGGC AATGAAAGAC GGTGAGCTGG TGATATGGGA TAGTGTTCAC 
961 CCTTGTTACA CCGTTTTCCA TGAGCAAACT GAAACGTTTT CATCGCTCTG GAGTGAATAC 
1021 CACGACGATT TCCGGCAGTT TCTACACATA TATTCGCAAG ATGTGGCGTG TTACGGTGAA 
1081 AACCTGGGCT ATTTCCCTAA AGGGTTTATT GAGAATATGT TTTTCGTCTC AGCCAATCCC 
1141 TGGGT GAGTT TCACCAGTTT TGATTTAAAC GTGGCCAATA TGGACAACTT CTTCGCCCCC 
1201 GTTTTCACCA TGGGCAAATA TTATACGCAA GGCGACAAGG TGCTGATGCC GCTGGCGATT 
1261 CAGGTTCATC ATGCCGTCTG TGATGGCTTC CATGTCGGCA GAATGCTTAA TGAATTACAA 
1321 CAGTACTGCG ATGAGTGGCA GGGCGGGGCG TAAACGCGTG GATCCGGCTT ACTAAAAGCC 
1381 AGATAACAGT ATGCGTATTT GCGCGCTGAT TTTTGCGGTA TAAGAATATA TACTGATATG 
1441 TATACCCGAA GTATGTCAAA AAGAGGTGTG CTATGAAGCA GCGTATTACA GTGACAGTTG 
1501 ACAGCGACAG CTATCAGTTG CTCAAGGCAT ATATGATGTC AATATCTCCG GTCTGGTAAG 
1561 CACAACCATG CAGAATGAAG CCCGTCGTCT GCGTGCCGAA CGCTGGAAAG CGGAAAATCA 
1621 GGAAGGGATG GCTGAGGTCG CCCGGTTTAT TGAAATGAAC GGCTCTTTTG CTGACGAGAA 
1681 CAGGGACTGG TGAAATGCAG TTTAAGGTTT ACACCTATAA AAGAGAGAGC CGTTATCGTC 
1741 TGTTTGTGGA TGTACAGAGT GATATTATTG ACACGCCCGG GCGACGGATG GTGATCCCCC 
1801 TGGCCAGTGC ACGTCTGCTG TCAGATAAAG TCTCCCGTGA ACTTTACCCG GTGGTGCATA 
1861 TCGGGGATGA AAGCTGGCGC ATGATGACCA CCGATATGGC CAGTGTGCCG GTCTCCGTTA 
1921 TCGGGGAAGA AGTGGCTGAT CTCAGCCACC GCGAAAATGA CATCAAAAAC GCCATTAACC 
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1981 TGATGTTCTG GGGAATATAA ATGTCAGGCT CCCTTATACA CAGCCAGTCT GCAGGTCGAC 
2041 CATAGTGACT GGATATGTTG TGTTTTACAG TATTATGTAG TCTGI I I I I I ATGCAAAATC 
2101 TAATTTAATA TATTGATATT TATATCATTT TACGTTTCTC GTTCAGCTTT CTTGTACAAA 
2161 GTGGTGATGA TCCGGCTGCT AACAAAGCCC GAAAGGAAGC TGAGTTGGCT GCTGCCACCG 
2221 CTGAGCAATA ACTAGCATAA CCCCTTGGGG CCTCTAAACG GGTCTTGAGG GG I II I I I GC 
2281 TGAAAGGAGG AACTATATCC GGATATCCAC AGGACGGGTG TGGTCGCCAT GATCGCGTAG 
2341 TCGATAGTGG CTCCAAGTAG CGAAGCGAGC AGGACTGGGC GGCGGCCAAA GCGGTCGGAC 
2401 AGTGCTCCGA GAACGGGTGC GCATAGAAAT TGCATCAACG CATATAGCGC TAGCAGCACG 
2461 CCATAGTGAC TGGCGATGCT GTCGGAATGG ACGATATCCC GCAAGAGGCC CGGCAGTACC 
2521 GGCATAACCA AGCCTATGCC TACAGCATCC AGGGTGACGG TGCCGAGGAT GACGATGAGC 
2581 GCATTGTTAG ATTTCATACA CGGTGCCTGA CTGCGTTAGC AATTTAACTG TGATAAACTA 
2641 CCGCATTAAA GCTTATCGAT GATAAGCTGT CAAACATGAG AATTCTTGAA GACGAAAGGG 
2701 CCTCGTGATA CGCCTATTTT TATAGGTTAA TGTCATGATA ATAATGGTTT CTTAGACGTC 
2761 AGGTGGCACT TTTCGGGGAA ATGTGCGCGG AACCCCTATT TGTTTATTTT TCTAAATACA 
2821 TTCAAATATG TATCCGCTCA TGAGACAATA ACCCTGATAA ATGCTTCAAT AATATTGAAA 
2881 AAGGAAGAGT ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 
2941 TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG CTGAAGATCA 
3001 GTTG GGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC AGCGGTAAGA TCCTTGAGAG 
3061 TTTTCGCCCC GAAGAACGTT TTCCAATGAT GAGCACTTTT AAAGTTCTGC TATGTGGCGC 
3121 GGTATTATCC CGTGTTGACG CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA 
3181 GAATGACTTG GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 
3241 AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA ACTTACTTCT 
3301 GACAACGATC GGAGGACCGA AGGAGCTAAC CGC I I I I I I G CACAACATGG GGGATCATGT 
3361 AACTCGCCTT GATCGTTGGG AACCGGAGCT GAATGAAGCC ATACCAAACG ACGAGCGTGA 
3421 CACCACGATG CCTGCAGCAA TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT 
3481 TACTCTAGCT TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 
3541 ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG GAGCCGGTGA 
3601 GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT GGTAAGCCCT CCCGTATCGT 
3661 AGTTATCTAC ACGACGGGGA GTCAGGCAAC TATGGATGAA CGAAATAGAC AGATCGCTGA 
3721 GATAGGTGCC TCACTGATTA AGCATTGGTA ACTGTCAGAC CAAGTTTACT CATATATACT 
3781 TTAGATTGAT TTAAAACTTC ATTTTTAATT TAAAAGGATC TAGGTGAAGA TCCTTTTTGA 
3841 TAATCTCATG ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT CAGACCCCGT 
3901 AGAAAAGATC AAAGGATCTT CTTGAGATCC I I I I I I ICTG CGCGTAATCT GCTGCTTGCA 
3961 AACAA AAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC TACCAACTCT 
4021 TTTTCCGAAG GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTCC TTCTAGTGTA 
4081 GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT 
4141 AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG GGTTGGACTC 
4201 AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT CGTGCACACA 
4261 GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG AGCTATGAGA 
4321 AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG 
4381 AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT ATAGTCCTGT 
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4441 CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG 
4501 CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT GCTGGCCTTT 
4561 TGCTCACATG TTCTTTCCTG CGTTATCCCC TGATTCTGTG GATAACCGTA TTACCGCCTT 
4621 TGAGTGAGCT GATACCGCTC GCCGCAGCCG AACGACCGAG CGCAGCGAGT CAGTGAGCGA 
4681 GGAAGCGGAA GAGCGCCTGA TGCGGTATTT TCTCCTTACG CATCTGTGCG GTATTTCACA 
4741 CCGCATATAT GGTGCACTCT CAGTACAATC TGCTCTGATG CCGCATAGTT AAGCCAGTAT 
4801 ACACTCCGCT ATCGCTACGT GACTGGGTCA TGGCTGCGCC CCGACACCCG CCAACACCCG 
4861 CTGACGCGCC CTGACGGGCT TGTCTGCTCC CGGCATCCGC TTACAGACAA GCTGTGACCG 
4921 TCTCCGGGAG CTGCATGTGT CAGAGGTTTT CACCGTCATC ACCGAAACGC GCGAGGCAGC 
4981 TGCGGTAAAG CTCATCAGCG TGGTCGTGAA GCGATTCACA GATGTCTGCC TGTTCATCCG 
5041 CGTCCAGCTC GTTGAGTTTC TCCAGAAGCG TTAATGTCTG GCTTCTGATA AAGCGGGCCA 
5101 TGTTAAGGGC GGI I I I I ICC TGTTTGGTCA CTGATGCCTC CGTGTAAGGG GGATTTCTGT 
5161 TCATGGGGGT AATGATACCG ATGAAACGAG AGAGGATGCT CACGATACGG GTTACTGATG 
5221 ATGAACATGC CCGGTTACTG GAACGTTGTG AGGGTAAACA ACTGGCGGTA TGGATGCGGC 
5281 GGGACCAGAG AAAAATCACT CAGGGTCAAT GCCAGCGCTT CGTTAATACA GATGTAGGTG 
5341 TTCCACAGGG TAGCCAGCAG CATCCTGCGA TGCAGATCCG GAACATAATG GTGCAGGGCG 
5401 CTGACTTCCG CGTTTCCAGA CTTTACGAAA CACGGAAACC GAAGACCATT CATGTTGTTG 
5461 CTCAGGTCGC AGACGTTTTG CAGCAGCAGT CGCTTCACGT TCGCTCGCGT ATCGGTGATT 
5521 CATTCTGCTA ACCAGTAAGG CAACCCCGCC AGCCTAGCCG GGTCCTCAAC GACAGGAGCA 
5581 CGATCATGCG CACCCGTGGC CAGGACCCAA CGCTGCCCGA GATGCGCCGC GTGCGGCTGC 
5641 TGGAGATGGC GGACGCGATG GATATGTTCT GCCAAGGGTT GGTTTGCGCA TTCACAGTTC 
5701 TCCGCAAGAA TTGATTGGCT CCAATTCTTG GAGTGGTGAA TCCGTTAGCG AGGTGCCGCC 
5761 GGCTTCCATT CAGGTCGAGG TGGCCCGGCT CCATGCACCG CGACGCAACG CGGGGAGGCA 
5821 GACAAGGTAT AGGGCGGCGC CTACAATCCA TGCCAACCCG TTCCATGTGC TCGCCGAGGC 
5881 GGCATAAATC GCCGTGACGA TCAGCGGTCC AGTGATCGAA GTTAGGCTGG TAAGAGCCGC 
5941 GAGCGATCCT TGAAGCTGTC CCTGATGGTC GTCATCTACC TGCCTGGACA GCATGGCCTG 
6001 CAACGCGGGC ATCCCGATGC CGCCGGAAGC GAGAAGAATC ATAATGGGGA AGGCCATCCA 
6061 GCCTCGCGTC GCGAACGCCA GCAAGACGTA GCCCAGCGCG TCGGCCGCCA TGCCGGCGAT 
6121 AATGGCCTGC TTCTCGCCGA AACGTTTGGT GGCGGGACCA GTGACGAAGG CTT6AGCGAG 
6181 GGCGTGCAAG ATTCCGAATA CCGCAAGCGA CAGGCCGATC ATCGTCGCGC TCCAGCGAAA 
6241 GCGGTCCTCG CCGAAAATGA CCCAGAGCGC TGCCGGCACC TGTCCTACGA GTTGCATGAT 
6301 AAAGAAGACA GTCATAAGTG CGGCGACGAT AGTCATGCCC CGCGCCCACC GGAAGGAGCT 
6361 GACTGGGTTG AAGGCTCTCA AGGGCATCGG TCGATCGACG CTCTCCCTTA TGCGACTCCT 
6421 GCATTAGGAA GCAGCCCAGT AGTAGGTTGA GGCCGTTGAG CACCGCCGCC GCAAGGAATG 
6481 GTGCATGCAA GGAGATGGCG CCCAACAGTC CCCCGGCCAC GGGGCCTGCC ACCATACCCA 
6541 CGCCGAAACA AGCGCTCATG AGCCCGAAGT GGCGAGCCCG ATCTTCCCCA TCGGTGATGT 
6601 CGGCGATATA GGCGCCAGCA ACCGCACCTG TGGCGCCGGT GATGCCGGCC ACGATGCGTC 
6661 CGGCGTAGAG GATCG 
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His6 Fusion in E. coli T7 Promoter 
T7 Promoter mRNA 

ago ceo caa egg ttt ccc 
tct ggt gtt gec aaa ggg 



1 gat ccc gcg aaa tit a ato cga etc act atafggg 
eta ggg cgc ttt aat tat get gag tga tat ccc 



Start Translation 
52 tct aga aat aat ttt gtt taa ctt taa gaa gga gat ata cat 



M S Y 
atg teg tac 



aga tct tta tta aaa coa att gaa att ctt cct eta tat gta tac age atg 



YHHHHHHLES 
103 tac cat cac cat cac cat cac etc gaa tea 
atg gta gtg gta gtg gto gtg gag ctt agt 



T 



K K 



aca agt ttg toe aaa aaa gef* 



tgt tea aac atg ttt ttt cga ^ 



attRt 



Intt 




fawllOS I 3352 
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PDEST17 6354 bp 



Location (Base Nos.) 
258.. 134 



Gene Encoded 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 



367.. 1026 
1146.. 1230 
1368.. 1673 
1714.. 1838 
2564.. 3421 



1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 



CG ATCCC GCG AAATTAATAC GACTCACTAT AGGGAGACCA CAACGGTTTC CCTCTAGAAA 
TAATTTTGTT TAACTTTAAG AAGGAGATAT ACATATGTCG TACTACCATC ACCATCACCA 
TCACCTCGAA TCAACAAGTT TGTACAAAAA AGCTGAACGA GAAACGTAAA ATGATATAAA 
TATCAATATA TTAAATTAGA TTTTGCATAA AAAACAGACT ACATAATACT GTAAAACACA 
ACATATCCAG TCACTATGGC GGCCGCATTA GGCACCCCAG GCTTTACACT TTATGCTTCC 
GGCTCGTATA ATGTGTGGAT TTTGAGTTAG GATCCGTCGA GATTTTCAGG AGCTAAGGAA 
GCTAAAATGG AGAAAAAAAT CACTGGATAT ACCACCGTTG ATATATCCCA ATGGCATCGT 
AAAGAACATT TTGAG GCATT TCAGTCAGTT GCTCAATGTA CCTATAACCA GACCGTTCAG 
CTGGATATTA CGGCCTTTTT AAAGACCGTA AAGAAAAATA AGCACAAGTT TTATCCGGCC 
TTTATTCACA TTCTTGCCCG CCTGATGAAT GCTCATCCGG AATTCCGTAT GGCAATGAAA 
GACGGTGAGC TGGTGATATG GGATAGTGTT CACCCTTGTT ACACCGTTTT CCATGAGCAA 
ACTGAAACGT TTTCATCGCT CTGGAGTGAA TACCACGACG ATTTCCGGCA GTTTCTACAC 
ATATATTCGC AAGATGTGGC GTGTTACGGT GAAAACCTGG CCTATTTCCC TAAAGGGTTT 
ATTGAGAATA TGTTTTTCGT CTCAGCCAAT CCCTGGGTGA GTTTCACCAG TTTTGATTTA 
AACGTGGCCA ATATGGACAA CTTCTTCGCC CCCGTTTTCA CCATGGGCAA ATATTATACG 
CAAGGCGACA AGGTGCTGAT GCCGCTGGCG ATTCAGGTTC ATCATGCCGT CTGTGATGGC 
TTGCATGTCG GCAGAATGCT TAATGAATTA CAACAGTACT GCGATGAGTG GCAGGGCGGG 
GC GTAAA GAT CTGGATCCGG CTTACTAAAA GCCAGATAAC AGTATGCGTA TTTGCGCGCT 
GATTTTTGCG GTATAAGAAT ATATACTGAT ATGTATACCC GAAGTATGTC AAAAAGAGGT 
GTGCTATGAA GCAGCGTATT ACAGTGACAG TTGACAGCGA CAGCTATCAG TTGCTCAAGG 
CATATATGAT GTCAATATCT CCGGTCTGGT AAGCACAACC ATGCAGAATG AAGCCCGTCG 
TCTGCGTGCC GAACGCTGGA AAGCGGAAAA TCAGGAAGGG ATGGCTGAGG TCGCCCGGTT 
TATTGAAATG AACGGCTCTT TTGCTGACGA GAACAGGGAC TGGTGAAATG CAGTTTAAGG 
TTTACACCTA TAAAAGAGAG AGCCGTTATC GTCTGTTTGT GGATGTACAG AGTGATATTA 
TTGACACGCC CGGGCGACGG ATGGTGATCC CCCTGGCCAG TGCACGTCTG CTGTCAGATA 
AAGTCTCCCG TGAACTTTAC CCGGTGGTGC ATATCGGGGA TGAAAGCTGG CGCATGATGA 
CCACCGATAT GGCCAGTGTG CCGGTCTCCG TTATCGGGGA AGAAGTGGCT GATCTCAGCC 
ACCGCGAAAA TGACATCAAA AACGCCATTA ACCTGATGTT CTGGGGAATA TAAATGTCAG 
GCTCCCTTAT ACACAGCCAG TCTGCAGGTC GACCATAGTG ACTGGATATG TTGTGTTTTA 
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1741 CAGTATTATG TAGTCTGTTT TTTATGCAAA 
1801 TTTTACGTTT CTCGTTCAGC TTTCTTGTAC 
1861 CCCGAAAGGA AGCTGAGTTG GCTGCTGCCA 
1921 GGGCCTCTAA ACGGGTCTTG AGGGGTTTTT 
1981 CACAGGACGG GTGTGGTCGC CATGATCGCG 
2041 AGCAGGACTG GGCGGCGGCC AAAGCGGTCG 
2101 AATTGCATCA ACGCATATAG CGCTAGCAGC 
2161 TGGACGATAT CCCGCAAGAG GCCCGGCAGT 
2221 TCCAGGGTGA CGGTGCCGAG GATGACGATG 
2281 TGACTGCGTT AGCAATTTAA CTGTGATAAA 
2341 TGTCAAACAT GAGAATTCTT GAAGACGAAA 
2401 TAATGTCATG ATAATAATGG TTTCTTAGAC 
2461 CGGAACCCCT ATTTGTTTAT TTTTCTAAAT 
2521 ATAACCCTGA TAAATGCTTC AATAATATTG 
2581 CCGTGTCGCC CTTATTCCCT TTTTTGCGGC 
2641 AACGCTGGTG AAAGTAAAAG ATGCTGAAGA 
2701 ACTGGATCTC AACAGCGGTA AGATCCTTGA 
2761 GATGAGCACT TTTAAAGTTC TGCTATGTGG 
2821 AGAGCAACTC GGTCGCCGCA TACACTATTC 
2881 CACAGAAAAG CATCTTACGG ATGGCATGAC 
2941 CATGAGTGAT AACACTGCGG CCAACTTACT 
3001 AACCGCTTTT TTGCACAACA TGGGGGATCA 
3061 GCTGAATGAA GCCATACCAA ACGACGAGCG 
3121 AACGTTGCGC AAACTATTAA CTGGCGAACT 
3181 AGACTGGATG GAGGCGGATA AAGTTGCAGG 
3241 CTGGTTTATT GCTGATAAAT CTGGAGCCGG 
3301 ACTGGGGCCA GATGGTAAGC CCTCCCGTAT 
3361 AACTATGGAT GAACGAAATA GACAGATCGC 
3421 GTAACTGTCA GACCAAGTTT ACTCATATAT 
3481 ATTTAAAAGG ATCTAGGTGA AGATCCTTTT 
3541 TGAGTTTTCG TTCCACTGAG CGTCAGACCC 
3601 TCC I I I I I I I CTGCGCGTAA TCTGCTGCTT 
3661 GGTTTGTTTG CCGGATCAAG AGCTACCAAC 
3721 AGCGCAGATA CCAAATACTG TCCTTCTAGT 
3781 CTCTGTAGCA CCGCCTACAT ACCTCGCTCT 
3841 TGGCGATAAG TCGTGTCTTA CCGGGTTGGA 
3901 GCGGTCGGGC TGAACGGGGG GTTCGTGCAC 
3961 CGAACTGAGA TACCTACAGC GTGAGCTATG 
4021 GGCGGACAGG TATCCGGTAA GCGGCAGGGT 



ATCTAATTTA ATATATTGAT ATTTATATCA 
AAAGTGGTTG ATTCGAGGCT GCTAACAAAG 
CCGCTGAGCA ATAACTAGCA TAACCCCTTG 
TGCTGAAAGG AGGAACTATA TCCGGATATC 
TAGTCGATAG TGGCTCCAAG TAGCGAAGCG 
GACAGTGCTC CGAGAACGGG TGCGCATAGA 
ACGCCATAGT GACTGGCGAT GCTGTCGGAA 
ACCGGCATAA CCAAGCCTAT GCCTACAGCA 
AGCGCATTGT TAGATTTCAT ACACGGTGCC 
CTACCGCATT AAAGCTTATC GATGATAAGC 
GGGCCTCGTG ATACGCCTAT TTTTATAGGT 
GTCAGGTGGC ACTTTTCGGG GAAATGTGCG 
ACATTCAAAT ATGTATCCGC TCATGAGACA 
AAAAAGGAAG AGTATGAGTA TTCAACATTT 
ATTTTGCCTT CCTGTTTTTG CTCACCCAGA 
TCAGTTGGGT GCACGAGTGG GTTACATCGA 
GAGTTTTCGC CCCGAAGAAC GTTTTCCAAT 
CGCGGTATTA TCCCGTGTTG ACGCCGGGCA 
TCAGAATGAC TTGGTTGAGT ACTCACCAGT 
AGTAAGAGAA TTATGCAGTG CTGCCATAAC 
TCTGACAACG ATCGGAGGAC CGAAGGAGCT 
TGTAACTCGC CTTGATCGTT GGGAACCGGA 
TGACACCACG ATGCCTGCAG CAATGGCAAC 
ACTTACTCTA GCTTCCCGGC AACAATTAAT 
ACCACTTCTG CGCTCGGCCC TTCCGGCTGG 
TGAGCGTGGG TCTCGCGGTA TCATTGCAGC 
CGTAGTTATC TACACGACGG GGAGTCAGGC 
TGAGATAGGT GCCTCACTGA TTAAGCATTG 
ACTTTAGATT GATTTAAAAC TTCATTTTTA 
TGATAATCTC ATGACCAAAA TCCCTTAACG 
CGTAGAAAAG ATCAAAGGAT CTTCTTGAGA 
GCAAACAAAA AAACCACCGC TACCAGCGGT 
TCTTTTTCCG AAGGTAACTG GCTTCAGCAG 
GTAGCCGTAG TTAGGCCACC ACTTCAAGAA 
GCTAATCCTG TTACCAGTGG CTGCTGCCAG 
CTCAAGACGA TAGTTACCGG ATAAGGCGCA 
ACAGCCCAGC TTGGAGCGAA CGACCTACAC 
AGAAAGCGCC ACGCTTCCCG AAGGGAGAAA 
CGGAACAGGA GAGCGCACGA GGGAGCTTCC 
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4081 AG6G 6GAAA C GCCTGGTATC TTTATAGTCC TGTCGGGTTT CGCCACCTCT GACTTGAGCG 
4141 TCGATTTTTG TGATGCTCGT CAGGGGGGCG GAGCCTATGG AAAAACGCCA GCAACGCGGC 
4201 CTTTTTACGG TTCCTGGCCT TTTGCTGGCC TTTTGCTCAC ATGTTCTTTC CTGCGTTATC 
4261 CCCTGATTCT GTGGATAACC GTATTACCGC CTTTGAGTGA GCTGATACCG CTCGCCGCAG 
4321 CCGAACGACC GAGCGCAGCG AGTCAGTGAG CGAGGAAGCG GAAGAGCGCC TGATGCGGTA 
4381 TTTTCTCCTT ACGCATCTGT GCGGTATTTC ACACCGCATA TATGGTGCAC TCTCAGTACA 
4441 ATCTGCTCTG ATGCCGCATA GTTAAGCCAG TATACACTCC GCTATCGCTA CGTGACTGGG 
4501 TCATGGCTGC GCCCCGACAC CCGCCAACAC CCGCTGACGC GCCCTGACGG GCTTGTCTGC 
4561 TCCCGGCATC CGCTTACAGA CAAGCTGTGA CCGTCTCCGG GAGCTGCATG TGTCAGAGGT 
4621 TTTCACCGTC ATCACCGAAA CGCGCGAGGC AGCTGCGGTA AAGCTCATCA GCGTGGTCGT 
4681 GAAGCGATTC ACAGATGTCT GCCTGTTCAT CCGCGTCCAG CTCGTTGAGT TTCTCCAGAA 
4741 GCGTTAATGT CTGGCTTCTG ATAAAGCGGG CCATGTTAAG GGCGGTTTTT TCCTGTTTGG 
4801 TCACTGATGC CTCCGTGTAA GGGGGATTTC TGTTCATGGG GGTAATGATA CCGATGAAAC 
4861 GAGAGAGGAT GCTCACGATA CGGGTTACTG ATGATGAACA TGCCCGGTTA CTGGAACGTT 
4921 GTGAGGGTAA ACAACTGGCG GTATGGATGC GGCGGGACCA GAGAAAAATC ACTCAGGGTC 
4981 AATGCCAGCG CTTCGTTAAT ACAGATGTAG GTGTTCCACA GGGTAGCCAG CAGCATCCTG 
5041 CGATGCAGAT CCGGAACATA ATGGTGCAGG GCGCTGACTT CCGCGTTTCC AGACTTTACG 
5101 AAACACGGAA ACCGAAGACC ATTCATGTTG TTGCTCAGGT CGCAGACGTT TTGCAGCAGC 
5161 AGTCGCTTCA CGTTCGCTCG CGTATCGGTG ATTCATTCTG CTAACCAGTA AGGCAACCCC 
5221 GCCAGCCTAG CCGGGTCCTC AACGACAGGA GCACGATCAT GCGCACCCGT GGCCAGGACC 
5281 CAACGCTGCC CGAGATGCGC CGCGTGCGGC TGCTGGAGAT GGCGGACGCG ATGGATATGT 
5341 TCTGCCAAGG GTTGGTTTGC GCATTCACAG TTCTCCGCAA GAATTGATTG GCTCCAATTC 
5401 TTGGAGTGGT GAATCCGTTA GCGAGGTGCC GCCGGCTTCC ATTCAGGTCG AGGTGGCCCG 
5461 GCTCCATGCA CCGCGACGCA ACGCGGGGAG GCAGACAAGG TATAGGGCGG CGCCTACAAT 
5521 CCATGCCAAC CCGTTCCATG TGCTCGCCGA GGCGGCATAA ATCGCCGTGA CGATCAGCGG 
5581 TCCAGTGATC GAAGTTAGGC TGGTAAGAGC CGCGAGCGAT CCTTGAAGCT GTCCCTGATG 
5641 GTCGTCATCT ACCTGCCTGG ACAGCATGGC CTGCAACGCG GGCATCCCGA TGCCGCCGGA 
5701 AGCGAGAAGA ATCATAATGG GGAAGGCCAT CCAGCCTCGC GTCGCGAACG CCAGCAAGAC 
5761 GTAGCCCAGC GCGTCGGCCG CCATGCCGGC GATAATGGCC TGCTTCTCGC CGAAACGTTT 
5821 GGTGGCGGGA CCAGTGACGA AGGCTTGAGC GAGGGCGTGC AAGATTCCGA ATACCGCAAG 
5881 CGACAGGCCG ATCATCGTCG CGCTCCAGCG AAAGCGGTCC TCGCCGAAAA TGACCCAGAG 
5941 CGCTGCCGGC ACCTGTCCTA CGAGTTGCAT GATAAAGAAG ACAGTCATAA GTGCGGCGAC 
6001 GATAGTCATG CCCCGCGCCC ACCGGAAGGA GCTGACTGGG TTGAAGGCTC TCAAGGGCAT 
6061 CGGTCGATCG ACGCTCTCCC TTATGCGACT CCTGCATTAG GAAGCAGCCC AGTAGTAGGT 
6121 TGAGGCCGTT GAGCACCGCC GCCGCAAGGA ATGGTGCATG CAAGGAGATG GCGCCCAACA 
6181 GTCCCCCGGC CACGGGGCCT GCCACCATAC CCACGCCGAA ACAAGCGCTC ATGAGCCCGA 
6241 AGTGGCGAGC CCGATCTTCC CCATCGGTGA TGTCGGCGAT ATAGGCGCCA GCAACCGCAC 
6301 CTGTGGCGCC GGTGATGCCG GCCACGATGC GTCCGGCGTA GAGGATCGAG ATCT 
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FastBac Transfer Vector with p 1 0 Baculovirus Promoter 

gaagacctcg gccgtcgcgg cgcttgccgg tggtgctgac cccggatgoa gtggltcgca 
cttctggagc cggcagcgcc gcgaacggcc accacgactg gggcctactt caccaagcgt 



gagcotcgtt tgttcgccco ggoctctagc tatagttcto 
ctcgtagcaa ocaagcgggt cctgagatcg otatcaogat 



61 tcctcggttt tctggaaggc 
aggagccaaa agaccttccg 

121 gtggttggct acgtatcgcg 
caccaaccga tgcatagctc 

plO 

181 tatttttaca aagattcaga aatacgcotc acttacoaca agggggacta tgaaattatg 
otaaooatgt ttctaogtct ttatgcgtag tgaatgttgt tccccctgat actttaatac 



caagaaoata aaocgccaoo cgcgttggag tcttgtgtgc 
gttcttttat tttgcggttt gcgcaacctc agaacacacg 

Promoter 



241 cattttgagg atgccgggac ctttaattca acccaacaca atotattata gttaaato 
gtooaoctcc tacggccctg gaaattaogt tgggttgtgt totataatat caatttat 



SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS/SSSSSSSSS 



ag mRNA 
tc 



301 



aattatttat caaatcattt gtotattoat taoaatacta tactgtooat tocattttat 
ttaataaata gtttagtaaa catataatta attttatgat atgacattta atgtaaacta 



361 ttacaotgog gotcotcacc ogtttgtaco oaooagctgo acgogoaacg taaaatgata 



aotgttactc ctagto gtgt tcaoocotgt tttttcgact tgctctttgc ottttactot 

Int t attRl 



Blnl 

Sail Hpol N Bcll\ 
BspMI\ Munl w 



.Nae 



NgoMIV 
— Ban' 



Xma 
Sma I \ 
Srf I - -- 

BbvC 
Bst1107. 
BssHII^^/4i 
BamHI 

Ncol- 
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pDEST18 6613 bp 



Location (Base Nos.) 



Gene Encoded 



474.. 1449 
1590.. 2244 
2738 . . 3850 
4251.. 4127 
4501.. 5160 
5280 . . 5364 
5502.. 5807 
5848 . . 5972 
6595.. 25 



ampR 

ori 

genR 

attRl 

CmR 



ccdB 

attR2 

lacZ 



inactivated ccdA 



1 GAC6CGCCCT GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG TGGTTACGCG CAGCGTGACC 
61 GCTACACTTG CCAGCGCCCT AGCGCCCGCT CCTTTCGCTT TCTTCCCTTC CTTTCTCGCC 
121 ACGTTCGCCG GCTTTCCCCG TCAAGCTCTA AATCGGGGGC TCCCTTTAGG GTTCCGATTT 
181 AGTGCTTTAC GGCACCTCGA CCCCAAAAAA CTTGATTAGG GTGATGGTTC ACGTAGTGGG 
241 CCATCGCCCT GATAGACGGT TTTTCGCCCT TTGACGTTGG AGTCCACGTT CTTTAATAGT 
301 GGACTCTTGT TCCAAACTGG AACAACACTC AACCCTATCT CGGTCTATTC TTTTGATTTA 
361 TAAGGGATTT TGCCGATTTC GGCCTATTGG TTAAAAAATG AGCTGATTTA ACAAAAATTT 
421 AACGCGAATT TTAACAAAAT ATTAACGTTT ACAATTTCAG GTGGCACTTT TCGGGGAAAT 
481 GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG 
541 AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA 
601 CATTTCCGTG TCGCCCTTAT TCCCI 1 1 I I I GCGGCATTTT GCCTTCCTGT TTTTGCTCAC 
661 CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC 
721 ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT 
781 CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TATTGACGCC 
841 GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA 
901 CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC 
961 ATAACCATGA G TGAT AACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG 
1021 GAGCTAACCG CI I I I I IGCA CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA 
1081 CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGTAGCAATG 
1141 GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA 
1201 TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG 
1261 GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT 
1321 GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT 
1381 CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG 
1441 CATT GGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT 
1501 TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT 
1561 TAACGTGAGT TTTCGT TCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT 
1621 TGAGATCCTT I I I I ICTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA 
1681 GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC 
1741 AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC 
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1801 AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT 
1861 GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG 
1921 GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC 
1981 TACACCGAAC TGAGATACCT ACAGCGTGAG CATTGAGAAA GCGCCACGCT TCCCGAAGGG 
2041 AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG 
2101 CTTCCAGGGG GAAA CGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT 
2161 GAGCG TCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC 
2221 GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG 
2281 TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC 
2341 CGCAG CCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG 
2401 CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC GCAGACCAGC CGCGTAACCT 
2461 GGCAAAATCG GTTACGGTTG AGTAATAAAT GGATGCCCTG CGTAAGCGGG TGTGGGCGGA 
2521 CAATAAAGTC TTAAACTGAA CAAAATAGAT CTAAACTATG ACAATAAAGT CTTAAACTAG 
2581 ACAGAATAGT TGTAAACTGA AATCAGTCCA GTTATGCTGT GAAAAAGCAT ACTGGACTTT 
2641 TGTTATGGCT AAAGCAAACT CTTCATTTTC TGAAGTGCAA ATTGCCCGTC GTATTAAAGA 
2701 GGGGCGTGGC CAAGGGCATG GTAAAGACTA TATTCGCGGC GTTGTGACAA TTTACCGAAC 
2761 AACTCCGCGG CCGGGAAGCC GATCTCGGCT TGAACGAATT GTTAGGTGGC GGTACTTGGG 
2821 TCGATATCAA AGTGCATCAC TTCTTCCCGT ATGCCCAACT TTGTATAGAG AGCCACTGCG 
2881 GGATCGTCAC CGTAATCTGC TTGCACGTAG ATCACATAAG CACCAAGCGC GTTGGCCTCA 
2941 TGCTTGAGGA GATTGATGAG CGCGGTGGCA ATGCCCTGCC TCCGGTGCTC GCCGGAGACT 
3001 GCGAGATCAT AGATATAGAT CTCACTACGC GGCTGCTCAA ACCTGGGCAG AACGTAAGCC 
3061 GCGAGAGCGC CAACAACCGC TTCTTGGTCG AAGGCAGCAA GCGCGATGAA TGTCTTACTA 
3121 CGGAGCAAGT TCCCGAGGTA ATCGGAGTCC GGCTGATGTT GGGAGTAGGT GGCTACGTCT 
3181 CCGAACTCAC GACCGAAAAG ATCAAGAGCA GCCCGCATGG ATTTGACTTG GTCAGGGCCG 
3241 AGCCTACATG TGCGAATGAT GCCCATACTT GAGCCACCTA ACTTTGTTTT AGGGCGACTG 
3301 CCCTGCTGCG TAACATCGTT GCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCAAACA 
3361 TCGACCCACG GCGTAACGCG CTTGCTGCTT GGATGCCCGA GGCATAGACT GTACAAAAAA 
3421 ACAGTCATAA CAAGCCATGA AAACCGCCAC TGCGCCGTTA CCACCGCTGC GTTCGGTCAA 
3481 GGTTCTGGAC CAGTTGCGTG AGCGCATACG CTACTTGCAT TACAGTTTAC GAACCGAACA 
3541 GGCTTATGTC AACTGGGTTC GTGCCTTCAT CCGTTTCCAC GGTGTGCGTC ACCCGGCAAC 
3601 CTTGGGCAGC AGCGAAGTCG AGGCATTTCT GTCCTGGCTG GCGAACGAGC GCAAGGTTTC 
3661 GGTCTCCACG CATCGTCAGG CATTGGCGGC CTTGCTGTTC TTCTACGGCA AGGTGCTGTG 
3721 CACGGATCTG CCCTGGCTTC AGGAGATCGG AAGACCTCGG CCGTCGCGGC GCTTGCCGGT 
3781 GGTGCTGACC CCGGATGAAG TGGTTCGCAT CCTCGGTTTT CTGGAAGGCG AGCATCGTTT 
3841 GTTCGCCCAG GACTCTAGCT ATAGTTCTAG TGGTTGGCTA CGTATCGAGC AAGAAAATAA 
3901 AACGCCAAAC GCGTTGGAGT CTTGTGTGCT ATTTTTACAA AGATTCAGAA ATACGCATCA 
3961 CTTACAACAA GGGGGACTAT GAAATTATGC ATTTTGAGGA TGCCGGGACC TTTAATTCAA 
4021 CCCAACACAA TATATTATAG TTAAATAAGA ATTATTTATC AAATCATTTG TATATTAATT 
4081 AAAATACTAT ACTGTAAATT ACATTTTATT TACAATGAGG ATCATCACAA GTTTGTACAA 
4141 AAAAGCTGAA CGAGAAACGT AAAATGATAT AAATATCAAT ATATTAAATT AGATTTTGCA 
4201 TAAAAAACAG ACTACATAAT ACTGTAAAAC ACAACATATC CAGTCACTAT GGCGGCCGCT 
4261 AAGTTGGCAG CATCACCCGA CGCACTTTGC GCCGAATAAA TACCTGTGAC GGAAGATCAC 
4321 TTCGCAGAAT AAATAAATCC TGGTGTCCCT GTTGATACCG GGAAGCCCTG GGCCAACTTT 
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4381 TGGCGAAAAT GAGACGTTGA TCGGCACGTA AGAGGTTCCA ACTTTCACCA TAATGAAATA 
4441 AGATCACTAC CGGGCGTATT TTTTGAGTTA TCGAGATTTT CAGGAGCTAA GGAAGCTAAA 
4501 ATGGAGAAAA AAATCACTGG ATATACCACC GTTGATATAT CCCAATGGCA TCGTAAAGAA 
4561 CATTTTGAGG CATTTCAGTC AGTTGCTCAA TGTACCTATA ACCAGACCGT TCAGCTGGAT 
4621 ATTACGGCCT TTTTAAAGAC CGTAAAGAAA AATAAGCACA AGTTTTATCC GGCCTTTATT 
4681 CACATTCTTG CCCGCCTGAT GAATGCTCAT CCGGAATTCC GTATGGCAAT GAAAGACGGT 
4741 GAGCTGGTGA TATGGGATAG TGTTCACCCT TGTTACACCG TTTTCCATGA GCAAACTGAA 
4801 ACGTTTTCAT CGCTCTGGAG TGAATACCAC GACGATTTCC GGCAGTTTCT ACACATATAT 
4861 TCGCAAGATG TGGCGTGTTA CGGTGAAAAC CTGGCCTATT TCCCTAAAGG GTTTATTGAG 
4921 AATATGTTTT TCGTCTCAGC CAATCCCTGG GTGAGTTTCA CCAGTTTTGA TTTAAACGTG 
4981 GCCAATATGG ACAACTTCTT CGCCCCCGTT TTCACCATGG GCAAATATTA TACGCAAGGC 
5041 GACAAGGTGC TGATGCCGCT GGCGATTCAG GTTCATCATG CCGTCTGTGA TGGCTTCCAT 
5101 GTCGGCAGAA TGCTTAATGA ATTACAACAG TACTGCGATG AGTGGCAGGG CGGGGCGTAA 
5161 ACGCGTGGAT CCGGCTTACT AAAAGCCAGA TAACAGTATG CGTATTTGCG CGCTGATTTT 
5221 TGCGGTATAA GAATATATAC TGATATGTAT ACCCGAAGTA TGTCAAAAAG AGGTGTGCTA 
5281 TGAAGCAGCG TATTACAGTG ACAGTTGACA GCGACAGCTA TCAGTTGCTC AAGGCATATA 
5341 TGATGTCAAT ATCTCCGGTC TGGTAAGCAC AACCATGCAG AATGAAGCCC GTCGTCTGCG 
5401 TGCCGAACGC TGGAAAGCGG AAAATCAGGA AGGGATGGCT GAGGTCGCCC GGTTTATTGA 
5461 AATGAACGGC TCTTTTGCTG ACGAGAACAG GGACTGGTGA AATGCAGTTT AAGGTTTACA 
5521 CCTATAAAAG AGAGAGCCGT TATCGTCTGT TTGTGGATGT ACAGAGTGAT ATTATTGACA 
5581 CGCCCGGGCG ACGGATGGTG ATCCCCCTGG CCAGTGCACG TCTGCTGTCA GATAAAGTCT 
5641 CCCGTGAACT TTACCCGGTG GTGCATATCG GGGATGAAAG CTGGCGCATG ATGACCACCG 
5701 ATATGGCCAG TGTGCCGGTC TCCGTTATCG GGGAAGAAGT GGCTGATCTC AGCCACCGCG 
5761 AAAATGACAT CAAAAACGCC ATTAACCTGA TGTTCTGGGG AATATAAATG TCAGGCTCCC 
5821 TTATACACAG CCAGTCTGCA GGTCGACCAT AGTGACTGGA TATGTTGTGT TTTACAGTAT 
5881 TATGTAGTCT Gl 1 1 1 1 IATG CAAAATCTAA TTTAATATAT TGATATTTAT ATCATTTTAC 
5941 GTTTCTCGTT CAGCTTTCTT GTACAAAGTG GTGATAGCTT GTCGAGAAGT ACTAGAGGAT 
6001 CATAATCAGC CATACCACAT TTGTAGAGGT TTTACTTGCT TTAAAAAACC TCCCACACCT 
6061 CCCCCTGAAC CTGAAACATA AAATGAATGC AATTGTTGTT GTTAACTTGT TTATTGCAGC 
6121 TTATAATGGT TACAAATAAA GCAATAGCAT CACAAATTTC ACAAATAAAG CA I I I I I I I C 
6181 ACTGCATTCT AGTTGTGGTT TGTCCAAACT CATCAATGTA TCTTATCATG TCTGGATCTG 
6241 ATCAC TGCTT GAGCCTAGGA GATCCGAACC AGATAAGTGA AATCTAGTTC CAAACTATTT 
6301 TGTCATTTTT AATTTTCGTA TTAGCTTACG ACGCTACACC CAGTTCCCAT CTATTTTGTC 
6361 ACTCTTCCCT AAATAATCCT TAAAAACTCC ATTTCCACCC CTCCCAGTTC CCAACTATTT 
6421 TGTCCGCCCA CAGCGGGGCA TTTTTCTTCC TGTTATGTTT TTAATCAAAC ATCCTGCCAA 
6481 CTCCATGTGA CAAACCGTCA TCTTCGGCTA CTTTTTCTCT GTCACAGAAT GAAAATTTTT 
6541 CTGTCATCTC TTCGTTATTA ATGTTTGTAA TTGACTGAAT ATCAACGCTT ATTTGCAGCC 
6601 TGAATGGCGA ATG 
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FostBoc Transfer Vector with 39K Baculovirus Promoter 

1 ggtgacgccg tcatctttcc attgtaacgt aaatggcoac ttgtagatga acgcgctgtc 
ccactgcggc agtagaaagg taacottgca tttaccgttg aacatctact tgcgcgacag 

61 aaaaaaccgg ccagtttctt ccacoaactc gcgcacggct gtctcgtaaa cttttgcgtc 
ttttttggcc ggtcooagaa ggtgtttgag cgcgtgccga cagogcattt gaaaacgcog 

39K Promoter 

121 gcaacaatcg cgotgacctc gtggtatgga aattttttct aaaaaagtgt cgttcatgtc 
cgttgttagc gctactggog coccotacct ttaaaaaogo ttttttcoca gcaagtacag 



181 ggcggcggcg ttcgcgctcc 
ccgccgccgc aagcgcgagg 



ggtacgcgcg acgggcacac agcaggacag ccttgtccgg 
tgcgcgc tgcccgtgtg tcgtcctgtc ggaacaggcc 



cca 



ottRI 



241 ctcgattatc ataaocoatc ctgcoggcat gcaagctgga tcatdocaag ttt |gtacooo 



gagctaatag tatttgttag gacgtccgta cgttcgacct agtog|tgttc aoacatgttt 

SnoBI 



Pstl 



Int t 



EcoRV 



Real 
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pDEST19 6668 bp (rotated to position 1000) 



Location (Base Nos.) 
515. .391 



Gene Encoded 



765 . . 1424 
1544.. 1628 
1766.. 2071 
2112.. 2236 
2852 . . 2895 
3344.. 4319 
4460.. 5114 
5608.. 52 



attRl 
CmR 



inactivated ccdA 



ccdB 

attR2 

lacZ 

ampR 

ori 

genR 



1 AGTGGTTCGC ATCCTCGGTT TTCTGGAAGG CGAGCATCGT TTGTTCGCCC AGGACTCTAG 
61 CTATAGTTCT AGTGGTTGGC TACGTATATC AAATACTTGT AGGTGACGCC GTCATCTTTC 
121 CATTGTAACG TAAATGGCAA CTTGTAGATG AACGCGCTGT CAAAAAACCG GCCAGTTTCT 
181 TCCACAAACT CGCGCACGGC TGTCTCGTAA ACTTTTGCGT CGCAACAATC GCGATGACCT 
241 CGTGGTATGG AAAI I 1 1 I IC TAAAAAAGTG TCGTTCATGT CGGCGGCGGG CGCGTTCGCG 
301 CTCCGGTACG CGCGACGGGC ACACAGCAGG ACAGCCTTGT CCGGCTCGAT TATCATAAAC 
361 AATCCTGCAG GCATGCAAGC TCGGATCATC ACAAGTTTGT ACAAAAAAGC TGAACGAGAA 
421 ACGTAAAATG ATATAAATAT CAATATATTA AATTAGATTT TGCATAAAAA ACAGACTACA 
481 TAATACTGTA AAACACAACA TATCCAGTCA CTATGGCGGC CGCTAAGTTG GCAGCATCAC 
541 CCGACGCACT TTGCGCCGAA TAAATACCTG TGACGGAAGA TCACTTCGCA GAATAAATAA 
601 ATCCTGGTGT CCCTGTTGAT ACCGGGAAGC CCTGGGCCAA CTTTTGGCGA AAATGAGACG 
661 TTGATCGGCA CGTAAGAGGT TCCAACTTTC ACCATAATGA AATAAGATCA CTACCGGGCG 
721 TAI I I I I IGA GTTATCGAGA TTTTCAGGAG CTAAGGAAGC TAAAATGGAG AAAAAAATCA 
781 CTGGATATAC CACCGTTGAT ATATCCCAAT GGCATCGTAA AGAACATTTT GAGGCATTTC 
841 AGTCAGTTGC TCAATGTACC TATAACCAGA CCGTTCAGCT GGATATTACG GCCTTTTTAA 
901 AGACCGTAAA GAAAAATAAG CACAAGTTTT ATCCGGCCTT TATTCACATT CTTGCCCGCC 
961 TGATGAATGC TCATCCGGAA TTCCGTATGG CAATGAAAGA CGGTGAGCTG GTGATATGGG 
1021 ATAGTGTTCA CCCTTGTTAC ACCGTTTTCC ATGAGCAAAC TGAAACGTTT TCATCGCTCT 
1081 GGAGTGAATA CCACGACGAT TTCCGGCAGT TTCTACACAT ATATTCGCAA GATGTGGCGT 
1141 GTTACGGTGA AAACCTGGCC TATTTCCCTA AAGGGTTTAT TGAGAATATG TTTTTCGTCT 
1201 CAGCCAATCC CTGGGTGAGT TTCACCAGTT TTGATTTAAA CGTGGCCAAT ATGGACAACT 
1261 TCTTCGCCCC CGTTTTCACC ATGGGCAAAT ATTATACGCA AGGCGACAAG GTGCTGATGC 
1321 CGCTGGCGAT TCAGGTTCAT CATGCCGTCT GTGATGGCTT CCATGTCGGC AGAATGCTTA 
1381 ATGAATTACA ACAGTACTGC GATGAGTGGC AGGGCGGGGC GTAAACGCGT GGATCCGGCT 
1441 TACTAAAAGC CAGATAACAG TATGCGTATT TGCGCGCTGA TTTTTGCGGT ATAAGAATAT 
1501 ATACTGATAT GTATACCCGA AGTATGTCAA AAAGAGGTGT GCTATGAAGC AGCGTATTAC 
1561 AGTGACAGTT GACAGCGACA GCTATCAGTT GCTCAAGGCA TATATGATGT CAATATCTCC 
1621 GGTCTGGTAA GCACAACCAT GCAGAATGAA GCCCGTCGTC TGCGTGCCGA ACGCTGGAAA 
1681 GCGGAAAATC AGGAAGGGAT GGCTGAGGTC GCCCGGTTTA TTGAAATGAA CGGCTCTTTT 
1741 GCTGACGAGA ACAGGGACTG GTGAAATGCA GTTTAAGGTT TACACCTATA AAAGAGAGAG 
1801 CCGTTATCGT CTGTTTGTGG ATGTACAGAG TGATATTATT GACACGCCCG GGCGACGGAT 
1861 GGTGATCCCC CTGGCCAGTG CACGTCTGCT GTCAGATAAA GTCTCCCGTG AACTTTACCC 
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1921 GGTGGTGCAT ATCGGGGATG AAAGCTGGCG CATGATGACC ACCGATATGG CCAGTGTGCC 
1981 GGTCTCCGTT ATCGGGGAAG AAGTGGCTGA TCTCAGCCAC CGCGAAAATG ACATCAAAAA 
2041 CGCCATTAAC CTGATGTTCT GGGGAATATA AATGTCAGGC TCCCTTATAC ACAGCCAGTC 
2101 TGCAGGTCGA CCATAGTGAC TGGATATGTT GTGTTTTACA GTAraTGTA GTCTGTTTTT 
2161 TATGCAAAAT CTAATTTAAT ATATTGATAT TTATATCATT TTACGTTTCT CGTTCAGCTT 
2221 TCTTGTACAA AGTGGTGATC GAGAAGTACT AGAGGATCAT AATCAGCCAT ACCACATTTG 
2281 TAGAGGTTTT ACTTGCTTTA AAAAACCTCC CACACCTCCC CCTGAACCTG AAACATAAAA 
2341 TGAATGCAAT TGTTGTTGTT AACTTGTTTA TTGCAGCTTA TAATGGTTAC AAATAAAGCA 
2401 ATAGCATCAC AAATTTCACA AATAAAGCAT 1 1 1 1 1 ICACT GCATTCTAGT TGTGGTTTGT 
2461 CCAAACTCAT CAATGTATCT TATCATGTCT GGATCTGATC ACTGCTTGAG CCTAGGAGAT 
2521 CCGAACCAGA TAAGTGAAAT CTAGTTCCAA ACTATTTTGT CATTTTTAAT TTTCGTA™ 
2581 GCTTACGACG CTACACCCAG TTCCCATCTA TTTTGTCACT CTTCCCTAAA TAATCCTTAA 
2641 AAACTCCATT TCC ACCCCT C CCAGTTCCCA ACTATTTTGT CCGCCCACAG CGGGGCATTT 
2701 TTCTTCCTGT TATGTTTTTA ATCAAACATC CTGCCAACTC CATGTGACAA ACCGTCATCT 
2761 TCGGCTACTT TTTCTCTGTC ACAGAATGAA AA 1 1 1 1 I CTG TCATCTCTTC GTTATTAATG 
2821 TTTGTAATTG ACTGAATATC AACGCTTATT TGCAGCCTGA ATGGCGAATG GACGCGCCCT 
2881 GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG TGGTTACGCG CAGCGTGACC GCTACACTTG 
2941 CCAGCGCCCT AGCGCCCGCT CCTTTCGCTT TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG 
3001 GCTTTCCCCG TCAAGCTCTA AATCGGGGGC TCCCTTTAGG GTTCCGATTT AGTGCTTTAC 
3061 GGCACCTCGA CCCCA AAAAA CTTGATTAGG GTGATGGTTC ACGTAGTGGG CCATCGCCCT 
3121 GATAGACGGT TTTTCGCCCT TTGACGTTGG AGTCCACGTT CTTTAATAGT GGACTCTTGT 
3181 TCCAAACTGG AACAACACTC AACCCTATCT CGGTCTATTC TTTTGATTTA TAAGGGATTT 
3241 TGCCGATTTC GGCCTATTGG TTAAAAAATG AGCTGATTTA ACAAAAATTT AACGCGAATT 
3301 TTAACAAAAT ATTA ACGTTT ACAATTTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA 
3361 CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC 
3421 CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG 
3481 TCGCCCTTAT TCCCI I I 1 1 1 GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC 
3541 TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG 
3601 ATCT CAACA G CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGMCGTTTT CCAATGATGA 
3661 GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TATTGACGCC GGGCAAGAGC 
3721 AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG 
3781 AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA 
3841 G TGATAA CAC TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG 
3901 CI I I I I IGCA CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA 
3961 ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGTAGCAATG GCAACAACGT 
4021 TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT 
4081 GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT 
4141 TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG 
4201 GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA 
4261 TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC 
4321 TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA 
4381 AAAGGATCTA GGTGAAGATC CTTTTTGATA - ATCTC ATGAC CAAAATCCCT TAACGTGAGT 
4441 TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT 
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4501 TTTTTCTGCG CGTAATCTGC TGCTTGCAAA 
4561 GTTTGCCGGA TCAAGAGCTA CCAACTCTTT 
4621 AGATACCAAA TACTGTCCTT CTAGTGTAGC 
4681 TAGCACCGCC TACATACCTC GCTCTGCTAA 
4741 ATAAGTCGTG TCTTACCGGG TTGGACTCAA 
4801 CGGGCTGAAC GGGGGGTTCG TGCACACAGC 
4861 TGAGATACCT ACAGCGTGAG CATTGAGAAA 
4921 ACAGGTATCC GGTAAGCGGC AGGGTCGGAA 
4981 GAAACGCCTG GTATCTTTAT AGTCCTGTCG 
5041 TTTTGTGATG CTCGTCAGGG GGGCGGAGCC 
5101 TACGGTTCCT GGCCTTTTGC TGGCCTTTTG 
5161 ATTCTGTGGA TAACCGTATT ACCGCCTTTG 
5221 CGACCGAGCG CAGCGAGTCA GTGAGCGAGG 
5281 TCCTTACGCA TCTGTGCGGT ATTTCACACC 
5341 GTTACGGTTG AGTAATAAAT GGATGCCCTG 
5401 TTAAACTGAA CAAAATAGAT CTAAACTATG 
5461 TGTAAACTGA MTCAGTCCA GTTATGCTGT 
5521 AAAGCAAACT CTTCATTTTC TGAAGTGCAA 
5581 CAAGGGCATG GTAAAGACTA TATTCGCGGC 
5641 CCGGGAAGCC GATCTCGGCT TGAACGAATT 
5701 AGTGCATCAC TTCTTCCCGT ATGCCCAACT 
5761 CGTAATCTGC TTGCACGTAG ATCACATAAG 
5821 GATTGATGAG CGCGGTGGCA ATGCCCTGCC 
5881 AGATATAGAT CTCACTACGC GGCTGCTCAA 
5941 CAACAACCGC TTCTTGGTCG AAGGCAGCAA 
6001 TCCCGAGGTA ATCGGAGTCC GGCTGATGTT 
6061 GACCGAAAAG ATCAAGAGCA GCCCGCATGG 
6121 TGCGAATGAT GCCCATACTT GAGCCACCTA 
6181 TAACATCGTT GCTGCTGCGT AACATCGTTG 
6241 GCGTAACGCG CTTGCTGCTT GGATGCCCGA 
6301 CAAGCCATGA AAACCGCCAC TGCGCCGTTA 
6361 CAGTTGCGTG AGCGCATACG CTACTTGCAT 
6421 AACTGGGTTC GTGCCTTCAT CCGTTTCCAC 
6481 AGCGAAGTCG AGGCATTTCT GTCCTGGCTG 
6541 CATCGTCAGG CATTGGCGGC CTTGCTGTTC 
6601 CCCTGGCTTC AGGAGATCGG AAGACCTCGG 
6661 CCGGATGA 



CAAAAAAACC ACCGCTACCA GCGGTGGTTT 
TTCCGAAGGT AACTGGCTTC AGCAGAGCGC 
CGTAGTTAGG CCACCACTTC AAGAACTCTG 
TCCTGTTACC AGTGGCTGCT GCCAGTGGCG 
GACGATAGTT ACCGGATAAG GCGCAGCGGT 
CCAGCTTGGA GCGAACGACC TACACCGAAC 
GCGCCACGCT TCCCGAAGGG AGAAAGGCGG 
CAGGAGAGCG CACGAGGGAG CTTCCAGGGG 
GGTTTCGCCA CCTCTGACTT GAGCGTCGAT 
TATGGAAAAA CGCCAGCAAC GCGGCCTTTT 
CTCACATGTT CTTTCCTGCG TTATCCCCTG 
AGTGAGCTGA TACCGCTCGC CGCAGCCGAA 
AAGCGGAAGA GCGCCTGATG CGGTATTTTC 
GCAGACCAGC CGCGTAACCT GGCAAAATCG 
CGTAAGCGGG TGTGGGCGGA CAATAAAGTC 
ACAATAAAGT CTTAAACTAG ACAGAATAGT 
GAAAAAGCAT ACTGGACTTT TGTTATGGCT 
ATTGCCCGTC GTATTAAAGA GGGGCGTGGC 
GTTGTGACAA TTTACCGAAC AACTCCGCGG 
GTTAGGTGGC GGTACTTGGG TCGATATCAA 
TTGTATAGAG AGCCACTGCG GGATCGTCAC 
CACCAAGCGC GTTGGCCTCA TGCTTGAGGA 
TCCGGTGCTC GCCGGAGACT GCGAGATCAT 
ACCTGGGCAG AACGTAAGCC GCGAGAGCGC 
GCGCGATGAA TGTCTTACTA CGGAGCAAGT 
GGGAGTAGGT GGCTACGTCT CCGAACTCAC 
ATTTGACTTG GTCAGGGCCG AGCCTACATG 
ACTTTGTTTT AGGGCGACTG CCCTGCTGCG 
CTGCTCCATA ACATCAAACA TCGACCCACG 
GGCATAGACT GTACAAAAAA ACAGTCATAA 
CCACCGCTGC GTTCGGTCAA GGTTCTGGAC 
TACAGTTTAC GAACCGAACA GGCTTATGTC 
GGTGTGCGTC ACCCGGCAAC CTTGGGCAGC 
GCGAACGAGC GCAAGGTTTC GGTCTCCACG 
TTCTACGGCA AGGTGCTGTG CACGGATCTG 
CCGTCGCGGC GCTTGCCGGT GGTGCTGACC 
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GLUTATHIONE-S-TRANSFERASE FUSION WITH POLYHEDRON 1 15 /oc, 

PROMOTER FOR BACULOVIRUS EXPRESSION 1 

polh PROMOTER 

ggc tac gta tac tec gga oto tta ata da t cot gga gat aat too oat gat' 
ccg atg cat atg agg cct tat aat tat gta gta cct eta tta ott tta ctOj 



mRNA 



aac cat etc gca aat aaa taa gta ttt tac tgt ttt cgt aac agt ttt gta 
jttg gta gag cgt tta ttt at|t cat aaa atg oca aoa gca ttg tea aaa cat j 



ata aaa aaa cct ata aat att 
; tat ttt ttt gga tat tta too 



ccg gat tat tea tac cgt ccc acc ate ggg 
ggc eta ata agt atg gca ggg tgg tag ccc 



START TRANSLN. 
583 cgc gga tec 
gcg cct agg 



MAP I - - - GST - - 
atg gec cct ata eta ggt tat tgg aaa att aag ggc ctt gtg 
tac egg gga tat gat cca ata acc ttt taa ttc ccg gaa cac 



S D L 



P R H N Q T S 



Y K K 



1 246 i Leg gat ctg gtt ccg cgt cat aat caa 
age eta gac caa ggc gca gta tta gtt 



aca agt tt jg tac aaa aaa get gaa 
tgt tea aac atg ttt|ttt cga ctt 



Int 



t 



1297 cga gaa acg taa aat gat ata aat ate aat ata tta aat tag at 
get ctt tgc att tta eta tat tta tag tta tat aat tta ate ta 



A/spV6787 
Bsg 1 6656' 
£co0109l6422 
£coN 1 6397 
SncB 1 6232 
Bbs 16103 



Swa 16815 
Not 1131 

Aw II 350 



fcoRI452 
PfM 1 678 
Xba 1 899 

fosHII946 



Tth\ 1 1 1 5578 

BseR 1 5299 — I 
£coRV5176 
Sst 115118 
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Srf 1 1 320 

Sma 11321 
Pst I 1574 
BspUl 1576 

Sal 1 1 580 
Mun 1 1830 

Hpo 1 1841 

Avr I1 1994 



A/goAIV2480 
Ban 112510 



BsaH I 3188 
Pvul I 3357 
Fsp I 3504 
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pDEST20 7066 bp (rotated to position 5800) 



Location (Base Nos.) 



592 . . 1263 
1397. .1273 
1506.. 2165 
2285 . . 2369 
2507.. 2812 
2853 . . 2977 
4214.. 5064 
5263 . . 5843 



Gene Encoded 

GST 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 



ampR 
ori 



1 CCACTGCGCC GTTACCACCG CTGCGTTCGG TCAAGGTTCT GGACCAGTTG CGTGAGCGCA 
61 TACGCTACTT GCATTACAGT TTACGAACCG AACAGGCTTA TGTCAACTGG GTTCGTGCCT 
121 TCATCCGTTT CCACGGTGTG CGTCACCCGG CAACCTTGGG CAGCAGCGAA GTCGAGGCAT 
181 TTCTGTCCTG GCTGGCGAAC GAGCGCAAGG TTTCGGTCTC CACGCATCGT CAGGCATTGG 
241 CGGCCTTGCT GTTCTTCTAC GGCAAGGTGC TGTGCACGGA TCTGCCCTGG CTTCAGGAGA 
301 TCGGAAGACC TCGGCCGTCG CGGCGCTTGC CGGTGGTGCT GACCCCGGAT GAAGTGGTTC 
361 GCATCCTCGG TTTTCTGGAA GGCGAGCATC GTTTGTTCGC CCAGGACTCT AGCTATAGTT 
421 CTAGTGGTTG GCTACGTATA CTCCGGAATA TTAATAGATC ATGGAGATAA TTAAAATGAT 
481 AACCATCTCG CAAATAAATA AGTATTTTAC TGTTTTCGTA ACAGTTTTGT AATAAAAAAA 
541 CCTATAAATA TTCCGGATTA TTCATACCGT CCCACCATCG GGCGCGGATC CATGGCCCCT 
601 ATACTAGGTT ATTGGAAAAT TAAGGGCCTT GTGCAACCCA CTCGACTTCT TTTGGAATAT 
661 CTTGAAGAAA AATATGAAGA GCATTTGTAT GAGCGCGATG AAGGTGATAA ATGGCGAAAC 
721 AAAAAGTTTG AATTGGGTTT GGAGTTTCCC AATCTTCCTT ATTATATTGA TGGTGATGTT 
781 AAATTAACAC AGTCTATGGC CATCATACGT TATATAGCTG ACAAGCACAA CATGTTGGGT 
841 GGTTGTCCAA AAGAGCGTGC AGAGATTTCA ATGCTTGAAG GAGCGGTTTT GGATATTAGA 
901 TACGGTGTTT CGAGAATTGC ATATAGTAAA GACTTTGAAA CTCTCAAAGT TGATTTTCTT 
961 AGCAAGCTAC CTGAAATGCT GAAAATGTTC GAAGATCGTT TATGTCATAA AACATATTTA 
1021 AATGGTGATC ATGTAACCCA TCCTGACTTC ATGTTGTATG ACGCTCTTGA TGTTGTTTTA 
1081 TACATGGACC CAATGTGCCT GGATGCGTTC CCAAAATTAG TTTGTTTTAA AAAACGTATT 
1141 GAAGCTATCC CACAAATTGA TAAGTACTTG AAATCCAGCA AGTATATAGC ATGGCCTTTG 
1201 CAGGGCTGGC AAGCCACGTT TGGTGGTGGC GACCATCCTC CAAAATCGGA TCTGGTTCCG 
1261 CGTCATAATC AAACAAGTTT GTACAAAAAA GCTGAACGAG AAACGTAAAA TGATATAAAT 
1321 ATCAATATAT TAAATTAGAT TTTGCATAAA AAACAGACTA CATAATACTG TAAAACACAA 
1381 CATATCCAGT CACTATGGCG GCCGCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG 
1441 GCTCGTATGT TGTGTGGATT TTGAGTTAGG ATCCGGCGAG ATTTTCAGGA GCTAAGGAAG 
1501 CTAAAATGGA GAAAAAAATC ACTGGATATA CCACCGTTGA TATATCCCAA TGGCATCGTA 
1561 AAGAACATTT TGAGGCATTT CAGTCAGTTG CTCAATGTAC CTATAACCAG ACCGTTCAGC 
1621 TGGATATTAC GGCCTTTTTA AAGACCGTAA AGAAAAATAA GCACAAGTTT TATCCGGCCT 
1681 TTATTCACAT TCTTGCCCGC CTGATGAATG CTCATCCGGA ATTCCGTATG GCAATGAAAG 
1741 ACGGTGAGCT GGTGATATGG GATAGTGTTC ACCCTTGTTA CACCGTTTTC CATGAGCAAA 
1801 CTGAAACGTT TTCATCGCTC TGGAGTGAAT ACCACGACGA TTTCCGGCAG TTTCTACACA 
1861 TATATTCGCA AGATGTGGCG TGTTACGGTG AAAACCTGGC CTATTTCCCT AAAGGGTTTA 
1921 TTGAGAATAT GTTTTTCGTC TCAGCCAATC CCTGGGTGAG TTTCACCAGT TTTGATTTAA 



FIG.40B 



117/253 

1981 ACGTGGCCAA TATGGACAAC TTCTTCGCCC CCGTTTTCAC CATGGGCAAA TATTATACGC 
2041 AAGGCGACAA GGTGCTGATG CCGCTGGCGA TTCAGGTTCA TCATGCCGTC TGTGATGGCT 
2101 TCCATGTCGG CAGAATGCTT AATGAATTAC AACAGTACTG CGATGAGTGG CAGGGCGGGG 
2161 C GTAAT CTAG AGGATCCGGC TTACTAAAAG CCAGATAACA GTATGCGTAT TTGCGCGCTG 
2221 ATTTTTGCGG TATAAGAATA TATACTGATA TGTATACCCG AAGTATGTCA AAAAGAGGTG 
2281 TGCTATGMG CAGCGTATTA CAGTGACAGT TGACAGCGAC AGCTATCAGT TGCTCAAGGC 
2341 ATATATGATG TCAATATCTC CGGTCTGGTA AGCACAACCA TGCAGAATGA AGCCCGTCGT 
2401 CTGCGTGCCG AACGCTGGAA AGCGGAAAAT CAGGAAGGGA TGGCTGAGGT CGCCCGGTTT 
2461 ATTGAAATGA ACGGCTCTTT TGCTGACGAG AACAGGGACT GGTGAAATGC AGTTTAAGGT 
2521 TTACACCTAT AAAAGAGAGA GCCGTTATCG TCTGTTTGTG GATGTACAGA GTGATATTAT 
2581 TGACACGCCC GGGCGACGGA TGGTGATCCC CCTGGCCAGT GCACGTCTGC TGTCAGATAA 
2641 AGTCTCCCGT GAACTTTACC CGGTGGTGCA TATCGGGGAT GAAAGCTGGC GCATGATGAC 
2701 CACCGATATG GCCAGTGTGC CGGTCTCCGT TATCGGGGAA GAAGTGGCTG ATCTCAGCCA 
2761 CCGCGAAAAT GACATCAAAA ACGCCATTAA CCTGATGTTC TGGGGAATAT AAATGTCAGG 
2821 CTCCCTTATA CACAGCCAGT CTGCAGGTCG ACCATAGTGA CTGGATATGT TGTGTTTTAC 
2881 AGTATTATGT AGTCTGTTTT TTATGCAAAA TCTAATTTAA TATATTGATA TTTATATCAT 
2941 TTTACGTTTC TCGTTCAGCT TTCTTGTACA AAGTGGTTTG ATAGCTTGTC GAGAAGTACT 
3001 AGAGGATCAT AATCAGCCAT ACCACATTTG TAGAGGTTTT ACTTGCTTTA AAAAACCTCC 
3061 CACACCTCCC CCTGAACCTG AAACATAAAA TGAATGCAAT TGTTGTTGTT AACTTGTTTA 
3121 TTGCAG CTTA TAATGGTTAC AAATAAAGCA ATAGCATCAC AAATTTCACA AATAAAGCAT 
3181 I I I I I ICACT GCATTCTAGT TGTGGTTTGT CCAAACTCAT CAATGTATCT TATCATGTCT 
3241 GGAT CTGAT C AC TGCTT GAG CCTAGGAGAT CCGAACCAGA TAAGTGAAAT CTAGTTCCAA 
3301 ACTAT TTTGT CATTTTTAAT TTTCGTATTA GCTTACGACG CTACACCCAG TTCCCATCTA 
3361 TTTTGTCACT CTTCCCTAAA TAATCCTTAA AAACTCCATT TCCACCCCTC CCAGTTCCCA 
3421 ACTATTTTGT CCGCCCACAG CGGGGCATTT TTCTTCCTGT TATGTTTTTA ATCAAACATC 
3481 CT GCCAA CTC CATGTGACAA ACCGTCATCT TCGGCTACTT TTTCTCTGTC ACAGAATGAA 
3541 AATTTTTCTG TCATCTCTTC GTTATTAATG TTTGTAATTG ACTGAATATC AACGCTTATT 
3601 TGCAGCCTGA ATGGCGAATG GACGCGCCCT GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG 
3661 TGGTTACGCG CAGCGTGACC GCTACACTTG CCAGCGCCCT AGCGCCCGCT CCTTTCGCTT 
3721 TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG GCTTTCCCCG TCAAGCTCTA AATCGGGGGC 
3781 TCCCTTTAGG GTTCCGATTT AGTGCTTTAC GGCACCTCGA CCCCAAAAAA CTTGATTAGG 
3841 GTGATGGTTC ACGTAGTGGG CCATCGCCCT GATAGACGGT TTTTCGCCCT TTGACGTTGG 
3901 AGTCCACGTT CTTTAATAGT GGACTCTTGT TCCAAACTGG AACAACACTC AACCCTATCT 
3961 CGGTCTATTC TTTTGATTTA TAAGGGATTT TGCCGATTTC GGCCTATTGG TTAAAAAATG 
4021 AGCTGATTTA ACAAAAATTT AACGCGAATT TTAACAAAAT ATTAACGTTT ACAATTTCAG 
4081 GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT 
4141 CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA 
4201 GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCC I I I 1 1 1 GCGGCATTTT 
4261 GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT 
4321 TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT 
4381 TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG 
4441 TATTATCCCG TATTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA 
4501 ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA 
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4561 GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA 
4621 CAACGATCGG AGGACCGAAG GAGCTAACCG CI 1 1 1 I IGCA CAACATGGGG GATCATGTAA 
4681 CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA 
4741 CCACGATGCC TGTAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA 
4801 CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC 
4861 TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC 
4921 GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG 
4981 TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGMCG AAATAGACAG ATCGCTGAGA 
5041 TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT 
5101 AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA 
5161 ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG 
5221 AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA 
5281 CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT 
5341 TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC 
5401 CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA 
5461 TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA 
5521 GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC 
5581 CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CATTGAGAAA 
5641 GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA 
5701 CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG 
5761 GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC 
5821 TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG 
5881 CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TMCCGTATT ACCGCCTTTG 
5941 AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG 
6001 AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC 
6061 GCAGACCAGC CGCGTAACCT GGCAAAATCG GTTACGGTTG AGTAATAAAT GGATGCCCTG 
6121 CGTAAGCGGG TGTGGGCGGA CAATAAAGTC TTAAACTGAA CAAAATAGAT CTAAACTATG 
6181 ACAATAAAGT CTTAAACTAG ACAGAATAGT TGTAAACTGA AATCAGTCCA GTTATGCTGT 
6241 GAAAAAGCAT ACTGGACTTT TGTTATGGCT AAAGCAAACT CTTCATTTTC TGAAGTGCAA 
6301 ATTGCCCGTC GTATTAAAGA GGGGCGTGGC CAAGGGCATG GTAAAGACTA TATTCGCGGC 
6361 GTTGTGACAA TTTACCGAAC AACTCCGCGG CCGGGAAGCC GATCTCGGCT TGAACGAATT 
6421 GTTAGGTGGC GGTACTTGGG TCGATATCAA AGTGCATCAC TTCTTCCCGT ATGCCCAACT 
6481 TTGTATAGAG AGCCACTGCG GGATCGTCAC CGTAATCTGC TTGCACGTAG ATCACATAAG 
6541 CACCAAGCGC GTTGGCCTCA TGCTTGAGGA GATTGATGAG CGCGGTGGCA ATGCCCTGCC 
6601 TCCGGTGCTC GCCGGAGACT GCGAGATCAT AGATATAGAT CTCACTACGC GGCTGCTCAA 
6661 ACCTGGGCAG AACGTAAGCC GCGAGAGCGC CAACAACCGC TTCTTGGTCG AAGGCAGCAA 
6721 GCGCGATGAA TGTCTTACTA CGGAGCAAGT TCCCGAGGTA ATCGGAGTCC GGCTGATGTT 
6781 GGGAGTAGGT GGCTACGTCT CCGAACTCAC GACCGAAAAG ATCAAGAGCA GCCCGCATGG 
6841 ATTTGACTTG GTCAGGGCCG AGCCTACATG TGCGAATGAT GCCCATACTT GAGCCACCTA 
6901 ACTTTGTTTT AGGGCGACTG CCCTGCTGCG TAACATCGTT GCTGCTGCGT AACATCGTTG 
6961 CTGCTCCATA ACATCAAACA TCGACCCACG GCGTAACGCG CTTGCTGCTT GGATGCCCGA 
7021 GGCATAGACT GTACAAAAAA ACAGTCATAA CAAGCCATGA AAACCG 
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802 CCG a ^ c ^ aa ^ caa 9 CC 

START TRANSLN M K L L S S - - Gal4-0B 
853 tec tga oag otg aog do ctg tct tct ate goo coo gca tgc gat att tgc 
agg act ttc tac ttc gat gac ago aga tag ctt gtt cgt acg eta taa acg 

---VSSRS 
1261gaa gag agt agt aac aaa ggt coo aga cag ttg act gta teg teg agg teg 
ctt etc tea tea ttg ttt cca gtt tct gtc aac tga cat age age tec age 



NQTSLYKKA 



atlRI 



1312 aat caa 
tta gtt 



aca agt tt |g tac ooa aaa get gaa cga gag acq too aat got oto ; 



tgt tea ooc otg t tt ttt cga ctt get ctt tgc att tta eta tat 
IFP ' 



T 



•Pacl.40 



•Bsgl.795 
.vHindlll,837 
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8054,Sgfr" 

7713.Nrul 



1443 

Acclll.1901 
•Ncol,2206 
/ ■Xbol,2352 
vBssHII.2399 

• BbvCI.2629 
/ • Srfl.2773 

Smal.2774 

• • -Xmal.2774 

• -Sall.3033 

•-• Vtatll.3186 
'-.'^113611,3192 

•Sacl.3192 

Fsp!,3863 
0ralll,4114 



7381,Eco721 
7356 



6619,Tth1 1 



"•BstEII.5092 
Clal.5209 
Xcml .5608 
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pDEST21 11713 bp (rotated to position 11000) 



Location (Base Nos.) 



857. .1322 
1456.. 1332 
1706.. 2365 
2485 . . 2569 
2707.. 3012 
3053. .3177 
3716. .3735 
3899 . . 4354 
4414. .6642 
7541.. 8515 
9668.. 10958 
11118. .848 



Gene Encoded 
GAL4DB 
attRl 
CmR 

inactivated ccdA 

ccdB 

attR2 

pT7 (T7 promoter) 

fl (fl intergenic region) 

Leu2 

kanR 

CYH2 

pADH (ADH promoter) 



1 TTTATTATGT TACAATATGG AAGGGAACTT TACACTTCTC CTATGCACAT ATA TTAATTA 
61 AAGTCCAATG CTAGTAGAGA AGGGGGGTAA CAGCCCTCCG CGCTCTTTTC CGAI 1 1 1 1 1 1 
121 CTAAACCGTG GAATATTTCG GATATCCTTT TGTTGTTTCC GGGTGTACAA TATGGACTTC 
181 CTCTTTTCTG GCAACCAAAC CCATACATCG GGATTCCTAT AATACCTTCG TTGGTCTCCC 
241 TAACATGTAG GTGGCGGAGG GGAGATATAC AATAGAACAG ATACCAGACA AGACATAATG 
301 GGCTAAACAA GACTACACCA ATTACACTGC CTCATTGATG GTGGTACATA ACGAA CTAAT 
361 ACTGTAGCCC TAGACTTGAT AGCCATCATC ATATCGAAGT TTC ACTACCC TTTTTCCATT 
421 TGCCATCTAT TGAAGTAATA ATAGGCGCAT GCAACTTCTT TTC I I I I I I I TTCTTTTCTC 
481 TCTCCCCCGT TGTTGTCTCA CCATATCCGC AATGACAAAA AAAATGATGG AAGACACTAA 
541 AGGAAAAAAT TAACGACAAA GACAGCACCA ACAGATGTCG TTGTTCCAGA GCTGATGAGG 
601 GGTATCTTCG AACACACGAA ACTTTTTCCT TCCTTCATTC ACGCACACTA CTCTCTAATG 
661 AGCAACGGTA TACGGCCTTC CTTCCAGTTA CTTGAATTTG AAATAAAAAA AGTTTGCCGC 
721 TTTGCTATCA AGTATAAATA GACCTGCAAT TATTAATCTT TTGTTTCCTC GTCATTGTTC 
781 TCGTTCCCTT TCTTCCTTGT TTCTTTTTCT GCACAATATT TCAAGCTATA CCAAGCATAC 
841 AATCAACTCC AAGCTTGAAG CAAGCCTCCT GAAAGATGAA GCTACTGTCT TCTATCGAAC 
901 AAGCATGCGA TATTTGCCGA CTTAAAAAGC TCAAGTGCTC CAAAGAAAAA CCGAAGTGCG 
961 CCAAGTGTCT GAAGAACAAC TGGGAGTGTC GCTACTCTCC CAAAACCAAA AGGTCTCCGC 
1021 TGACTAGGGC ACATCTGACA GAAGTGGAAT CAA GGCTA GA AAGACTGGAA CAGCTATTTC 
1081 TACTGATTTT TCCTCGAGAA GACCTTGACA TGATTTTGAA AATGGATTCT TTACAGGATA 
1141 TAAAAGCATT GTTAACAGGA TTATTTGTAC AAGATAATGT GAATAAAGAT GCCGTCACAG 
1201 ATAGATTGGC TTCAGTGGAG ACTGATATGC CTCTAACATT GAGACAGCAT AGAATAAGTG 
1261 CGACATCATC ATCGGAAGAG AGTAGTAACA AAGGTCAAAG ACAGTTGACT GTATCGTCGA 
1321 GGTCGAATCA AACAAGTTTG TACAAAAAAG CTGAACGAGA AACGTAAAAT GATATAAATA 
1381 TCAATATATT AAATTAGATT TTGCATAAAA AACAGACTAC ATAATACTGT AAAACACAAC 
1441 ATATCCAGTC ACTATGGCGG CCGCTAAGTT GGCAGCATCA CCCGACGCAC TTTGCGCCGA 
1501 ATAAATACCT GTGACGGAAG ATCACTTCGC AGAATAAATA AATCCTGGTG TCCCTGTTGA 
1561 TACCGGGAAG CCCTGGGCCA ACTTTTGGCG AAAATGAGAC GTT GATCGG C ACGTAAGAGG 
1621 TTCCAACTTT CACCATAATG AAATAAGATC ACTACCGGGC GTAI I I I I IG AGTTATCGAG 
1681 ATTTTCAGGA GCTAAGGAAG CTAAAATGGA GAAAAAAATC ACTGGATATA CCACCGTTGA 
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1741 TATATCCCAA TGGCATCGTA AAGAACATTT TGAGGCATTT CAGTCAGTTG CTCAATGTAC 
1801 CTATAACCAG ACCGTTCAGC TGGATATTAC GGCCTTTTTA AAGACCGTAA AGAAAAATAA 
1861 GCACAAGTTT TATCCGGCCT TTATTCACAT TCTTGCCCGC CTGATGAATG CTCATCCGGA 
1921 ATTCC GTAT G GCAATGAAAG ACGGTGAGCT GGTGATATGG GATAGTGTTC ACCCTTGTTA 
1981 CACCGTTTTC CATGAGCAAA CTGAAACGTT TTCATCGCTC TGGAGTGAAT ACCACGACGA 
2041 TTTCCGGCAG TTTCTACACA TATATTCGCA AGATGTGGCG TGTTACGGTG AAAACCTGGC 
2101 CTATTTCCCT AAAGGGTTTA TTGAGAATAT GTTTTTCGTC TCAGCCAATC CCTGGGTGAG 
2161 TTTCACCAGT TTTGATTTAA ACGTGGCCAA TATGGACAAC TTCTTCGCCC CCGTTTTCAC 
2221 CATGGGCAAA TATTATACGC AAGGCGACAA GGTGCTGATG CCGCTGGCGA TTCAGGTTCA 
2281 TCATGCCGTC TGTGATGGCT TCCATGTCGG CAGAATGCTT AATGAATTAC AACAGTACTG 
2341 CGATGAGTGG CAGGGCGGGG CGTAATCTAG AGGATCCGGC TTACTAAAAG CCAGATAACA 
2401 GTATGCGTAT TTGCGCGCTG ATTTTTGCGG TATAAGAATA TATACTGATA TGTATACCCG 
2461 AAGTATGTCA AAAAGAGGTG TGCTATGAAG CAGCGTATTA CAGTGACAGT TGACAGCGAC 
2521 AGCTATCAGT TGCTCAAGGC ATATATGATG TCAATATCTC CGGTCTGGTA AGCACAACCA 
2581 TGCAGAATGA AGCCCGTCGT CTGCGTGCCG AACGCTGGAA AGCGGAAAAT CAGGAAGGGA 
2641 TGGCTGAGGT CGCCCGGTTT ATTGAAATGA ACGGCTCTTT TGCTGACGAG AACAGGGACT 
2701 GGTGAAATGC AGTTTAAGGT TTACACCTAT AAAAGAGAGA GCCGTTATCG TCTGTTTGTG 
2761 GATGTACAGA GTGATATTAT TGACACGCCC GGGCGACGGA TGGTGATCCC CCTGGCCAGT 
2821 GCACGTCTGC TGTCAGATAA AGTCTCCCGT GAACTTTACC CGGTGGTGCA TATCGGGGAT 
2881 GAAAGCTGGC GCATGATGAC CACCGATATG GCCAGTGTGC CGGTCTCCGT TATCGGGGAA 
2941 GAAGTGGCTG ATCTCAGCCA CCGCGAAAAT GACATCAAAA ACGCCATTAA CCTGATGTTC 
3001 TGGGGAATAT AAATGTCAGG CTCCCTTATA CACAGCCAGT CTGCAGGTCG ACCATAGTGA 
3061 CTGGATATGT TGTGTTTTAC AGTATTATGT AGTCTGTTTT TTATGCAAAA TCTAATTTAA 
3121 TATATTGATA TTTATATCAT TTTACGTTTC TCGTTCAGCT TTCTTGTACA AAGTGGTTTG 
3181 ATGGCCGCTA AGTAAGTAAG ACGTCGAGCT CTAAGTAAGT AACGGCCGCC ACCGCGGTGG 
3241 AGCTTTGGAC TTCTTCGCCA GAGGTTTGGT CAAGTCTCCA ATCAAGGTTG TCGGCTTGTC 
3301 TACCTTGCCA GAAATTTACG AAAAGATGGA AAAGGGTCAA ATCGTTGGTA GATACGTTGT 
3361 TGACACTTCT AAATAAGCGA ATTTCTTATG ATTTATGATT TTTATTATTA AATAAGTTAT 
3421 AAAAAAAATA AGTGTATACA AATTTTAAAG TGACTCTTAG GTTTTAAAAC GAAAATTCTT 
3481 ATTCTTGAGT AACTCTTTCC TGTAGGTCAG GTTGCTTTCT CAGGTATAGC ATGAGGTCGC 
3541 TCTTATTGAC CACACCTCTA CCGGCATGCC GAGCAAATGC CTGCAAATCG CTCCCCATTT 
3601 CACCCAATTG TAGATATGCT AACTCCAGCA ATGAGTTGAT GAATCTCGGT GTGTATTTTA 
3661 TGTCCTCAGA GGACAATACC TGTTGTAATC GTTCTTCCAC ACGGATCCCA ATTCGCCCTA 
3721 TAGTGAGTCG TATTACAATT CACTGGCCGT CGTTTTACAA CGTCGTGACT GGGAAAACCC 
3781 TGGCGTTACC CAACTTAATC GCCTTGCAGC ACATCCCCCT TTCGCCAGCT GGCGTAATAG 
3841 CGAAGAGGCC CGCACCGATC GCCCTTCCCA ACAGTTGCGC AGCCTGAATG GCGAATGGAC 
3901 GCGCCCTGTA GCGGCGCATT AAGCGCGGCG GGTGTGGTGG TTACGCGCAG CGTGACCGCT 
3961 ACACTTGCCA GCGCCCTAGC GCCCGCTCCT TTCGCTTTCT TCCCTTCCTT TCTCGCCACG 
4021 TTCGCCGGCT TTCCCCGTCA AGCTCTAAAT CGGGGGCTCC CTTTAGGGTT CCGATTTAGT 
4081 GCTTTACGGC ACCTCGACCC CAAAAAACTT GATTAGGGTG ATGGTTCACG TAGTGGGCCA 
4141 TCGCCCTGAT AGACGGTTTT TCGCCCTTTG ACGTTGGAGT CCACGTTCTT TAATAGTGGA 
4201 CTCT TGTT CC AAACTGGAAC AACACTCAAC CCTATCTCGG TCTATTCTTT TGATTTATAA 
4261 GGGATTTTGC CGATTTCGGC CTATTGGTTA AAAAATGAGC TGATTTAACA AAAATTTAAC 
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4321 GCGAATTTTA ACAAAATATT AACGTTTACA ATTTCCTGAT GCGGTATTTT CTCCTTACGC 
4381 ATCTGTGCGG TATTTCACAC CGCATATCGA CCGGTCGAGG AGAACTTCTA GTATATCCAC 
4441 ATACCTAATA TTATTGCCTT ATTAAAAATG GAATCGGAAC AATTACATCA AAATCCACAT 
4501 TCTCTTCAAA ATCAATTGTC CTGTACTTCC TTGTTCATGT GTGTTCAAAA ACGTTATATT 
4561 TATAGGATAA TTATACTCTA TTTCTCAACA AGTAATTGGT TGTTTGGCCG AGCGGTCTAA 
4621 GGCGCCTGAT TCAAGAAATA TCTTGACCGC AGTTAACTGT GGGAATACTC AGGTATCGTA 
4681 AGATGCAAGA GTTCGAATCT CTTAGCAACC ATTAI I I I I I TCCTCAACAT AACGAGAACA 
4741 CACAGGGGCG CTATCGCACA GAATCAAATT CGATGACTGG AAAI I I I I IG TTAATTTCAG 
4801 AGGTCGCCTG ACGCATATAC CTTTTTCAAC TGAAAAATTG GGAGAAAAAG GAAAGGTGAG 
4861 AGGCCGGAAC CGGCTTTTCA TATAGAATAG AGAAGCGTTC ATGACTAAAT GCTTGCATCA 
4921 CAATACTTGA AGTTGACAAT ATTATTTAAG GACCTATTGT TTTTTCCAAT AGGTGGTTAG 
4981 CAATCGTCTT ACTTTCTAAC TTTTCTTACC TTTTACATTT CAGCAATATA TATATATATT 
5041 TCAAGGATAT ACCATTCTAA TGTCTGCCCC TATGTCTGCC CCTAAGAAGA TCGTCGTTTT 
5101 GCCAGGTGAC CACGTTGGTC AAGAAATCAC AGCCGAAGCC ATTAAGGTTC TTAAAGCTAT 
5161 TTCTGATGTT CGTTCCAATG TCAAGTTCGA TTTCGAAAAT CATTTAATTG GTGGTGCTGC 
5221 TATCG ATGCT ACAGGTGTCC CACTTCCAGA TGAGGCGCTG GAAGCCTCCA AGAAGGTTGA 
5281 TGCCGTTTTG TTAGGTGCTG TGGGTGGTCC TAAATGGGGT ACCGGTAGTG TTAGACCTGA 
5341 ACAAGGTTTA CTAAAAATCC GTAAAGAACT TCAATTGTAC GCCAACTTAA GACCATGTAA 
5401 CTTTGCATCC GACTCTCTTT TAGACTTATC TCCAATCAAG CCACAATTTG CTAAAGGTAC 
5461 TGACTTCGTT GTTGTCAGAG AATTAGTGGG AGGTATTTAC TTTGGTAAGA GAAAGGAAGA 
5521 CGATGGTGAT GGTGTCGCTT GGGATAGTGA ACAATACACC GTTCCAGAAG TGCAAAGAAT 
5581 CACAAGAATG GCCGCTTTCA TGGCCCTACA ACATGAGCCA CCATTGCCTA TTTGGTCCTT 
5641 GGATAAAGCT AATGTTTTGG CCTCTTCAAG ATTATGGAGA AAAACTGTGG AGGAAACCAT 
5701 CAAGAACGAA TTCCCTACAT TGAAGGTTCA ACATCAATTG ATTGATTCTG CCGCCATGAT 
5761 CCTAGTTAAG AACCCAACCC ACCTAAATGG TATTATAATC ACCAGCAACA TGTTTGGTGA 
5821 TATCATCTCC GATGAAGCCT CCGTTATCCC AGGTTCCTTG GGTTTGTTGC CATCTGCGTC 
5881 CTTGGCCTCT TTGCCAGACA AGAACACGGC ATTTGGTTTG TACGAACCAT GCCACGGTTC 
5941 TGCTCCAGAT TTGCCAAAGA ATAAGGTTGA CCCTATCGCC ACTATCTTGT CTGCTGCAAT 
6001 GATGT TGAAA TTGTCATTGA ACTTGCCTGA AGAAGGTAAG GCCATTGAAG ATGCAGTTAA 
6061 AAAGGTTTTG GATGCAGGTA TCAGAACTGG TGATTTAGGT GGTTCCAACA GTACCACCGA 
6121 AGTCGGTGAT GCTGTCGCCG AAGAAGTTAA GAAAATCCTT GCTTAAAAAG ATTCTCTTTT 
6181 TTTATGATAT TTGTACATAA ACTTTATAAA TGAAATTCAT AATAGAAACG ACACGAAATT 
6241 ACAAAATGGA ATATGTTCAT AGGGTAGACG AAACTATATA CGCAATCTAC ATACATTTAT 
6301 CAAGAAGGAG AAAAAGGAGG ATAGTAAAGG AATACAGGTA AGCAAATTGA TACTAATGGC 
6361 TCAACGTGAT AAGGAAAAAG AATTGCACTT TAACATTAAT ATTGACAAGG AGGAGGGCAC 
6421 CAC ACAAAA A GTTAGGTGTA ACAGAAAATC ATGAAACTAC GATTCCTAAT TTGATATTGG 
6481 AGGATTTTCT CTAAAAAAAA AAAAATACAA CAAATAAAAA ACACTCAATG ACCTGACCAT 
6541 TTGAT GGAGT TTAAGTCAAT ACCTTCTTGA ACCATTTCCC ATAATGGTGA AAGTTCCCTC 
6601 AAGAATTTTA CTCTGTCAGA AACGGCCTTA CGACGTAGTC GATATGGTGC ACTCTCAGTA 
6661 CAATCTGCTC TGATGCCGCA TAGTTAAGCC AGCCCCGACA CCCGCCAACA CCCGCTGACG 
6721 CGCCCTGACG GGCTTGTCTG CTCCCGGCAT CCGCTTACAG ACAAGCTGTG ACCGTCTCCG 
6781 GGAGCTGCAT GTGTCAGAGG TTTTCACCGT CATCACCGAA ACGCGCGAGA CGAAAGGGCC 
6841 TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGGACGGAT 
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6901 CGCTTGCCTG TAACTTACAC GCGCCTCGTA 
6961 ACTCTGTGTT TATTTATTTT TATGTTTTGT 
7021 TAGAAGAGTT ACGGAATGAA GAAAAAAAAA 
7081 AGCGTACTTT ACATATATAT TTATTAGACA 
7141 ATTAACGATA AGTAAAATGT AAAATCACAG 
7201 AGATGAAACA ATTCGGCATT AATACCTGAG 
7261 TGTTGGCGAT CCCCCTAGAG TCTTTTACAT 
7321 ATTTC 1 1 I I I TTACTTTCTA TTTTTAATTT 
7381 ATTATTTTTA TAGCACGTGA TGAAAAGGAC 
7441 GGAACCCCTA TTTGTTTATT TTTCTAAATA 
7501 TAACCCTGAT AAATGCTTCA ATAATCTGCA 
7561 TTACATTGCA CAAGATAAAA ATATATCATC 
7621 CAGTAATACA AGGGGTGTTA TGAGCCATAT 
7681 ATTAAATTCC AACATGGAtG CTGATTTATA 
7741 GCAATCAGGT GCGACAATCT TTCGATTGTA 
7801 GAAACATGGC AAAGGTAGCG TTGCCAATGA 
7861 GCTGACGGAA TTTATGCCTC TTCCGACCAT 
7921 ATGGTTACTC ACCACTGCGA TCCGCGGGAA 
7981 TGATTCAGGT GAAAATATTG TTGATGCGCT 
8041 TCCTGTTTGT AATTGTCCTT TTAACAGCGA 
8101 ACGAATGAAT AACGGTTTGG TTGATGCGAG 
8161 TGTTGAACAA . GTCTGGAAAG AAATGCATAC 
8221 CACTCATGGT GATTTCTCAC TTGATAACCT 
8281 TATTGATGTT GGACGAGTCG GAATCGCAGA 
8341 CTGCCTCGGT GAGTTTTCTC CTTCATTACA 
8401 TAATCCTGAT ATGAATAAAT TGCAGTTTCA 
8461 ATTGGTTAAT TGGTTGTAAC ACTGGCAGAG 
8521 ACCAAAATCC CTTAACGTGA GTTTTCGTTC 
8581 AAAGGATCTT CTTGAGATCC I I 1 1 I I ICTG 
8641 CCACCGCTAC CAGCGGTGGT TTGTTTGCCG 
8701 GTAACTGGCT TCAGCAGAGC GCAGATACCA 
8761 GGCCACCACT TCAAGAACTC TGTAGCACCG 
8821 CCAGTGGCTG CTGCCAGTGG CGATAAGTCG 
8881 TTACCGGATA AGGCGCAGCG GTCGGGCTGA 
8941 GAGCGAACGA CCTACACCGA ACTGAGATAC 
9001 CTTCCCGAAG GGAGAAAGGC GGACAGGTAT 
9061 CGCACGAGGG AGCTTCCAGG GGGGAACGCC 
9121 CACCTCTGAC TTGAGCGTCG ATTTTTGTGA 
9181 AACGCCAGCA ACGCGGCCTT TTTACGGTTC 
9241 TTCTTTCCTG CGTTATCCCC TGATTCTGTG 
9301 GATACCGCTC GCCGCAGCCG AACGACCGAG 
9361 GAGCGCCCAA TACGCAAACC GCCTCTCCCC 
9421 CACGACAGGT TTCCCGACTG GAAAGCGGGC 



TCTTTTAATG ATGGAATAAT TTGGGAATTT 
ATTTGGATTT TAGAAAGTAA ATAAAGAAGG 
TAAACAAAGG TTTAAAAAAT TTCAACAAAA 
AGAAAAGCAG ATTAAATAGA TATACATTCG 
GATTTTCGTG TGTGGTCTTC TACACAGACA 
AGCAGGAAGA GCAAGATAAA AGGTAGTATT 
CTTCGGAAAA CAAAAACTAT TTTTTCTTTA 
ATATATTTAT ATTAAAAAAT TTAAATTATA 
CCAGGTGGCA CTTTTCGGGG AAATGTGCGC 
CATTCAAATA TGTATCCGCT CATGAGACAA 
GCTCTGGCCC GTGTCTCAAA ATCTCTGATG 
ATGAACAATA AAACTGTCTG CTTACATAAA 
TCAACGGGAA ACGTCTTGCT GGAGGCCGCG 
TGGGTATAAA TGGGCTCGCG ATAATGTCGG 
TGGGAAGCCC GATGCGCCAG AGTTGTTTCT 
TGTTACAGAT GAGATGGTCA GACTAAACTG 
CAAGCATTTT ATCCGTACTC CTGATGATGC 
AACAGCATTC CAGGTATTAG AAGAATATCC 
GGCAGTGTTC CTGCGCCGGT TGCATTCGAT 
TCGCGTATTT CGTCTCGCTC AGGCGCAATC 
TGATTTTGAT GACGAGCGTA ATGGCTGGCC 
GCTTTTGCCA TTCTCACCGG ATTCAGTCGT 
TATTTTTGAC GAGGGGAAAT TAATAGGTTG 
CCGATACCAG GATCTTGCCA TCCTATGGAA 
GAAACGGCTT TTTCAAAAAT ATGGTATTGA 
TTTGATGCTC GATGAGTTTT TCTAATCAGA 
CATTACGCTG ACTTGACGGG ACGGCGCATG 
CACTGAGCGT CAGACCCCGT AGAAAAGATC 
CGCGTAATCT GCTGCTTGCA AACAAAAAAA 
GATCAAGAGC TACCAACTCT TTTTCCGAAG 
AATACTGTCC TTCTAGTGTA GCCGTAGTTA 
CCTACATACC TCGCTCTGCT AATCCTGTTA 
TGTCTTACCG GGTTGGACTC AAGACGATAG 
ACGGGGGGTT CGTGCACACA GCCCAGCTTG 
CTACAGCGTG AGCATTGAGA AAGCGCCACG 
CCGGTAAGCG GCAGGGTCGG AACAGGAGAG 
TGGTATCTTT ATAGTCCTGT CGGGTTTCGC 
TGCTCGTCAG GGGGGCCGAG CCTATGGAAA 
CTGGCCTTTT GCTGGCCTTT TGCTCACATG 
GATAACCGTA TTACCGCCTT TGAGTGAGCT 
CGCAGCGAGT CAGTGAGCGA GGAAGCGGAA 
GCGCGTTGGC CGATTCATTA ATGCAGCTGG 
AGTGAGCGCA ACGCAATTAA TGTGAGTTAC 
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9481 CTCACTCATT AGGCACCCCA GGCTTTAGAC TTTATGCTTC CGGCTCCTAT GTTGTGTGGA 
9541 ATTGTGAGCG GATAACAATT TCACACAGGA AACAGCTATG ACCATGATTA CGCCAAGCTC 
9601 GGAATTAACC CTCACTAAAG GGAACAAAAG CTGGT ACCGA TCCCGAGCTT TGCAAATTAA 
9661 AGCCTTCGAG CGTCCCAAAA CCTTCTCAAG CAAGGTTTTC AGTATAATGT TACATGCGTA 
9721 CACGCGTCTG TACAGAAAAA AAAGAAAAAT TTGAAATATA AATAACGTTC TTAATACTAA 
9781 CATAACTATA AAAAAATAAA TAGGGACCTA GACTTCAGGT TGTCTAACTC CTTCCTTTTC 
9841 GGTTAGAGCG GATGTGGGGG GAGGGCGTGA ATGTAA GCGT GACATAACTA ATTACATGAT 
9901 ATCGACAAAG GAAAAGGGGC CTGTTTACTC ACAGGCTTTT TTCAAGTAGG TAATTAAGTC 
9961 GTTTCTGTCT TTTTCCTTCT TCAACCCACC AAAGGCCATC TTGGTACTTT I I 1 1 1 1 1 1 1 1 
10021 I 1 1 I I II I I I I I I I I 1 1 1 I I 1 1 1 1 1 I I I I I I I I I I I I I I I 1 1 I 1 1 1 1 I I I I I 1 1 1 1 1 1 1 1 
10081 I 1 1 1 I I I 1 1 1 I I I I 1 1 1 1 1 1 TCATAGAAAT AATACAGAAG TAGATGTTGA ATTAGATTAA 
10141 ACTGAAGATA TATAATTTAT TGGAAAATAC ATAGAG CTTT TTGTTGATGC GCTTAAGCGA 
10201 TCAATTCAAC AACACCACCA GCAGCTCTGA I I I I I ICTTC AGCCAACTTG GAGACGAATC 
10261 TAGCTTTGAC GATAACTGGA ACATTTGGAA TTCTACCCTT ACCCAAGATC TTACCGTAAC 
10321 CGGCTGCCAA AGTGTCAATA ACTGGAGCAG TTTCCTTAGA AGCAGATTTC AAGTATTGGT 
10381 CTCTCTTGTC TTCTGGGATC AATGTCCACA ATTTGTCCAA GTTCAAGACT GGCTTCCAGA 
10441 AATGAGCTTG TTGCTTGTGG AAGTATCTCA TACCAACCTT ACCGAAATAA CCTGGATGGT 
10501 ATTTATCCAT GTTAATTCTG TGGTGATGTT GACCACCGGC CATACCTCTA CCACCGGGGT 
10561 GCTTTCTGTG CTTACCGATA CGACCTTTAC CGGCTGAGAC GTGACCTCTG TGCTTTCTAG 
10621 TCTTAGTGAA TCTGGAAGGC ATTCTTGATT AGTTGGATGA TTGTTCTGGG ATTTAATGCA 
10681 AAAATCACTT AAGAAGGAAA ATCAACGGAG AAAGCAAACG CCATCTTAAA TATACGGGAT 
10741 ACAGATGAAA GGGTTTGAAC CTATCTGGAA AATAGCATTA AACAAGCGAA AAACTGCGAG 
10801 GAAAATTGTT TGCGTCTCTG CGGGCTATTC ACGCGCCAGA GGAAAATAGG AAAAATAACA 
10861 GGGCATTAGA AAAATAATTT TGATTTTGGT AATGTGTGGG TCCTGGTGTA CAGATGTTAC 
10921 ATTGGTTACA GTACTCTTGT TTTTGCTGTG TTTTTCGATG AATCTCCAAA ATGGTTG™ 
10981 GCACATGGAA GAGTCACCGA TGCTAAGTTA TCTCTATGTA AGCTACGTGG CGTGACTTTT 
11041 GATGAAGCCG CACAAGAGAT ACAGGATTGG CAACTGCAAA TAGAATCTGG GGATCCCCCC 
11101 TCGAGATCCG GGATCGAAGA AATGATGGTA AATGAAATAG GAAATCAAGG AGCATGAAGG 
11161 CAAAAGACAA ATATAAGGGT CGAACGAAAA ATAAAGTGAA AAGTGTTGAT ATGATGTATT 
11221 TGGCTTTGCG GCGCCGAAAA AACGAGTTTA CGCAATTGCA CAATCATGCT GACTCTGTGG 
11281 CGGACCCGCG CTCTTGCCGG CCCGGCGATA ACGCTGGGCG TGAGGCTGTG CCCGGCGGAG 
11341 I M 1 I I GCGC CTGCATTTTC CAAGGTTTAC CCTGCGCTAA GGGGCGAGAT TGGAGAAGCA 
11401 ATAAGAATGC CGGTTGGGGT TGCGATGATG ACGACCACGA CAACTGGTGT CATTATTTAA 
11461 GTTGCCGAAA GAACCTGAGT GCATTTGCAA CATGAGTATA CTAGAAGAAT GAGCCAAGAC 
11521 TTGCGAGACG CGAGTTTGCC GGTGGTGCGA ACAATAGAGC GACCATGACC TTGAAGGTGA 
11581 GACGCGCATA ACCGCTAGAG TACTTTGAAG AGGAAACAGC AATAGGGTTG CTACCAGTAT 
11641 AAATAGACAG GTACATACAA CACTGGAAAT GGTTGTCTGT TTGAGTACGC TTTCAATTCA 
11701 TTTGGGTGTG CAC 
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2— HYBRID VECTOR WITH ACTIVATION DOMAIN 125/253 

657 acg cac act act etc taa tga gca acg gta tac ggc ctt cct tec agt tac 

tgc gtg tga tga gag att act cgt tgc cot atg ccg gaa gga agg tea atg 

708 ttg aat ttg aaa taa aaa aag ttt gec get ttg eta tea agt ata oat ago 

aac tta aac ttt att ttt ttc aaa egg cga oac gat agt tea tat tta tct 

759 cct gca att att aat ctt ttg ttt cct cgt cat tgt tct cgt tec ctt tct 

gga cgt taa taa tta gaa aac aaa gga gca gta aca ago gca agg gaa aga 



ADH PROMOTER 





cc aag ctt 
^gg ttc gaa 



M P K K Gal4-AD ( 

atg ccc aag aog aag egg aag gtc teg age ggc gec aat 
tac ggg ttc ttc ttc gec ttc cag ogc teg ccg egg tta } 



START TRANSLATION 



DGGS.NQTS 



1218 gaa gat acc cca cca aac cca aaa aaa gag ggt ggg teg aat caa 
ctt eta tgg ggt ggt ttg ggt ttt ttt etc cca ccc age tta gtt 

L Y K K A attRI , 



aca agt 
tgt tea 



1269 'tt |g tac aaa aaa get goo ego gaa acg taa a 

ttt ego ctt get ctt tgc att t 



aac atg ttt 

l — f — r 



Int 



8318,Bsp120l. 

83l8.Apol-. 
8312,Kpnfc 
8312,Acc65l:i 



•Pacl.71 



7021,Ahdl 
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pDEST22 8923 bp 



Location (Base Nos.) 



Gene Encoded 
GAL4 AD 
attRl 
CmR 

inactivated ccdA 

ccdB 

attR2 



904.. 1248 
1388.. 1264 
1638.. 2297 
2417.. 2501 
2639 . . 2944 
2985.. 3 109 
3831.. 4318 
4334.. 5176 
6110.. 7194 
8344.. 866 



TRP1 



ampR 

pADH (yeast ADH promoter) 



fl (fl intergenic region) 



1 TTCATTTGGG TGTGCACTTT ATTATGTTAC AATATGGAAG GGAACTTTAC ACTTCTCCTA 
61 TGCACATATA TTAATTAAAG TCCAATGCTA GTAGAGAAGG GGGGTAACAC CCCTCCGCGC 
121 TCTTTTCCGA 1 1 1 1 1 1 ICTA AACCGTGGAA TATTTCGGAT ATCCTTTTGT TGTTTCCGGG 
181 TGTACAATAT GGACTTCCTC TTTTCTGGCA ACCAAACCCA TACATCGGGA TTCCTATAAT 
241 ACCTTCGTTG GTCTCCCTAA CATGTAGGTG GCGGAGGGGA GATATACAAT AGAACAGATA 
301 CCAGACAAGA CATAATGGGC TAAACAAGAC TACACCAATT ACACTGCCTC ATTGATGGTG 
361 GTACATAACG AACTAATACT GTAGCCCTAG ACTTGATAGC CATCATCATA TCGAAGTTTC 
421 ACTACCCTTT TTCCATTTGC CATCTATTGA AGTAATAATA GGCGCATGCA ACTTCTTTTC 
481 I I I I I I I I IC TTTTCTCTCT CCCCCGTTGT TGTCTCACCA TATCCGCAAT GACAAAAAAA 
541 ATGATGGAAG ACACTAAAGG AAAAAATTAA CGACAAAGAC AGCACCAACA GATGTCGTTG 
601 TTCCAGAGCT GATGAGGGGT ATCTTCGAAC ACACGAAACT TTTTCCTTCC TTCATTCACG 
661 CACACTACTC TCTAATGAGC AACGGTATAC GGCCTTCCTT CCAGTTACTT GAATTTGAAA 
721 TAAAAAAAGT TTGCCGCTTT GCTATCAAGT ATAAATAGAC CTGCAATTAT TAATCTTTTG 
781 TTTCCTCGTC ATTGTTCTCG TTCCCTTTCT TCCTTGTTTC TTTTTCTGCA CAATATTTCA 
841 AGCTATACCA AGCATACAAT CAACTCCAAG CTTATGCCCA AGAAGAAGCG GAAGGTCTCG 
901 AGCGGCGCCA ATTTTAATCA AAGTGGGAAT ATTGCTGATA GCTCATTGTC CTTCACTTTC 
961 ACTAACAGTA GCAACGGTCC GAACCTCATA ACAACTCAAA CAAATTCTCA AGCGCTTTCA 
1021 CAACCAATTG CCTCCTCTAA CGTTCATGAT AACTTCATGA ATAATGAAAT CACGGCTAGT 
1081 AAAATTGATG ATGGTAATAA TTCAAAACCA CTGTCACCTG GTTGGACGGA CCAAACTGCG 
1141 TATAACGCGT TTGGAATCAC TACAGGGATG TTTAATACCA CTACAATGGA TGATGTATAT 
1201 AACTATCTAT TCGATGATGA AGATACCCCA CCAAACCCAA AAAAAGAGGG TGGGTCGAAT 
1261 CAAACAAGTT TGTACAAAAA AGCTGAACGA GAAACGTAAA ATGATATAAA TATCAATATA 
1321 TTAAATTAGA TTTTGCATAA AAAACAGACT ACATAATACT GTAAAACACA ACATATCCAG 
1381 TCACTATGGC GGCCGCTAAG TTGGCAGCAT CACCCGACGC ACTTTGCGCC GAATAAATAC 
1441 CTGTGACGGA AGATCACTTC GCAGAATAAA TAAATCCTGG TGTCCCTGTT GATACCGGGA 
1501 AGCCCTGGGC CAACTTTTGG CGAAAATGAG ACGTTGATCG GCACGTAAGA GGTTCCAACT 
1561 TTCACCATAA TGAAATAAGA TCACTACCGG GCGTATTTTT TGAGTTATCG AGATTTTCAG 
1621 GAGCTAAGGA AGCTAAAATG GAGAAAAAAA TCACTGGATA TACCACCGTT GATATATCCC 
1681 AATGGCATCG TAAAGAACAT TTTGAGGCAT TTCAGTCAGT TGCTCAATGT ACCTATAACC 
1741 AGACCGTTCA GCTGGATATT ACGGCCTTTT TAAAGACCGT AAAGAAAAAT AAGCACAAGT 
1801 TTTATCCGGC CTTTATTCAC ATTCTTGCCC GCCTGATGAA TGCTCATCCG GAATTCCGTA 
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1861 TGGCAATGAA AGACGGTGAG CTGGTGATAT GGGATAGTGT TCACCCTTGT TACACCGTTT 
1921 TCCATGAGCA AACTGAAACG TTTTCATCGC TCTGGAGTGA ATACCACGAC GATTTCCGGC 
1981 AGTTTCTACA CATATATTCG CAAGATGTGG CGTGTTACGG TGAAAACCTG GCCTATTTCC 
2041 C TAAA GGGTT TATTGAGAAT ATGTTTTTCG TCTCAGCCAA TCCCTGGGTG AGTTTCACCA 
.2101 GTTTTGATTT AAACGTGGCC AATATGGACA ACTTCTTCGC CCCCGTTTTC ACCATGGGCA 
2161 AATATTATAC GCAAGGCGAC AAGGTGCTGA TGCCGCTGGC GATTCAGGTT CATCATGCCG 
2221 TCTGTGATGG CTTCCATGTC GGCAGAATGC TTAATGAATT ACAACAGTAC TGCGATGAGT 
2281 GGCAGGGCGG GGC GTAAT CT AGAGGATCCG GCTTACTAAA AGCCAGATAA CAGTATGCGT 
2341 ATTTGCGCGC TGATTTTTGC GGTATAAGAA TATATACTGA TATGTATACC CGAAGTATGT 
2401 CAAAAAGAGG TGTGCTATGA AGCAGCGTAT TACAGTGACA GTTGACAGCG ACAGCTATCA 
2461 GTTGCTCAAG GCATATATGA TGTCAATATC TCCGGTCTGG TAAGCACAAC CATGCAGAAT 
2521 GAAGCCCGTC GTCTGCGTGC CGAACGCTGG AAAGCGGAAA ATCAGGAAGG GATGGCTGAG 
2581 GTCGCCCGGT TTATTGAAAT GMCGGCTCT TTTGCTGACG AGAACAGGGA CTGGTGAAAT 
2641 GCAGTTTAAG GTTTACACCT ATAAAAGAGA GAGCCGTTAT CGTCTGTTTG TGGATGTACA 
2701 GAGTGATATT ATTGACACGC CCGGGCGACG GATGGTGATC CCCCTGGCCA GTGCACGTCT 
2761 GCTGTCAGAT AAAGTCTCCC GTGAACTTTA CCCGGTGGTG CATATCGGGG ATGAAAGCTG 
2821 GCGCATGATG ACCACCGATA TGGCCAGTGT GCCGGTCTCC GTTATCGGGG AAGAAGTGGC 
2881 TGATCTCAGC CACCGCGAAA ATGACATCAA AAACGCCATT AACCTGATGT TCTGGGGAAT 
2941 ATAAATGTCA GGCTCCCTTA TACACAGCCA GTCTGCAGGT CGACCATAGT GACTGGATAT 
3001 GTTGTGTTTT ACAGTATTAT GTAGTCTGTT TTTTATGCAA AATCTAATTT AATATATTGA 
3061 TATTTATATC ATTTTACGTT TCTCGTTCAG CTTTCTTGTA CAAAGTGGTT TGATGGCCGC 
3121 TAAGTAAGTA AGACGTCGAG CTCTAAGTAA GTAACGGCCG CCACCGCGGT GGAGCTTTGG 
3181 ACTTCTTCGC CAGAGGTTTG GTCAAGTCTC CAATCAAGGT TGTCGGCTTG TCTACCTTGC 
3241 CAGAAATTTA CGAAAAGATG GAAAAGGGTC AAATCGTTGG TAGATACGTT GTTGACACTT 
3301 CTAAATAAGC GAAT TTCTT A TGATTTATGA TTTTTATTAT TAAATAAGTT ATAAAAAAAA 
3361 TAAGTGTATA CAAATTTTAA AGTGACTCTT AGGTTTTAAA ACGAAAATTC TTATTCTTGA 
3421 GTAACTCTTT CCTGTAGGTC AGGTTGCTTT CTCAGGTATA GCATGAGGTC GCTCTTATTG 
3481 ACCACACCTC TACCGGCATG CCGAGCAAAT GCCTGCAAAT CGCTCCCCAT TTCACCCAAT 
3541 TGTAGATATG CTAACTCCAG CMTGAGTTG ATGAATCTCG GTGTGTATTT TATGTCCTCA 
3601 GAGGACAATA CCTGTTGTAA TCGTTCTTCC ACACGGATCC CAATTCGCCC TATAGTGAGT 
3661 CGTATTACAA TTCACTGGCC GTCGTTTTAC AACGTCGTGA CTGGGAAAAC CCTGGCGTTA 
3721 CCCAACTTAA TCGCCTTGCA GCACATCCCC CTTTCGCCAG CTGGCGTAAT AGCGAAGAGG 
3781 CCCGCACCGA TCGCCCTTCC CAACAGTTGC GCAGCCTGAA TGGCGAATGG ACGCGCCCTG 
3841 TAGCGGCGCA TTAAGCGCGG CGGGTGTGGT GGTTACGCGC AGCGTGACCG CTACACTTGC 
3901 CAGCGCCCTA GCGCCCGCTC CTTTCGCTTT CTTCCCTTCC TTTCTCGCCA CGTTCGCCGG 
396.1 CTTTCCCCGT CAAGCTCTAA ATCGGGGGCT CCCTTTAGGG TTCCGATTTA GTGCTTTACG 
4021 GCACCTCGAC CCCAAAAAAC TTGATTAGGG TGATGGTTCA CGTAGTGGGC CATCGCCCTG 
4081 ATAGACGGTT TTTCGCCCTT TGACGTTGGA GTCCACGTTC TTTAATAGTG GACTCTTGTT 
4141 CCAAACTGGA ACAACACTCA ACCCTATCTC GGTCTATTCT TTTGATTTAT AAGGGATTTT 
4201 GCCGATTTCG GCCTATTGGT TAAAAAATGA GCTGATTTAA CAAAAATTTA ACGCGAATTT 
4261 TAACAAAATA TTAACGTTTA CAATTTCCTG ATGCGGTATT TTCTCCTTAC GCATCTGTGC 
4321 GGTATTTCAC ACCGC AGGCA AGTGCACAAA CAATACTTAA ATAAATACTA CTCAGTAATA 
4381 ACCTATTTCT TAGCATTTTT GACGAAATTT GCTATTTTGT TAGAGTCTTT TACACCATTT 
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4441 GTCTCCACAC CTCCGCTTAC ATCAACACCA 
4501 ACATTTTCTG GCGTCAGTCC ACCAGCTAAC 
4561 CTTCCAACCC AGTCAGAAAT CGAGTTCCAA 
4621 GMTCAAACA AGGGAATAAA CGAATGAGGT 
4681 CAGTCTTTTG GAAATACGAG TCTTTTAATA 
4741 TGCCACGACT CATCTCCATG CAGTTGGACG 
4801 AAA ACATC CT CCTTAGGTTG ATTACGAAAC 
4861 CTAI I I I IAT ATGCTTTTAC AAGACTTGAA 
4921 CTCTTTCTAT TGGGCACACA TATAATACCC 
4981 TCTGCGGCCT CTGTGCTCTG CAAGCCGCAA 
5041 AAATTAATAA CAGACATACT CCAAGCTGCC 
5101 CTCAATAGTC ACCAATGCCC TCCCTCTTGG 
5161 ATTCTTAATC GGCAAAAAAA GAAAAGCTCC 
5221 A I I I I I CAAT AAAGAATATC TTCCACTACT 
5281 ATATATTACG ATGCTGTCTA TTAAATGCTT 
5341 TGGTGCACTC TCAGTACAAT CTGCTCTGAT 
5401 CCAACACCCG CTGACGCGCC CTGACGGGCT 
5461 GCTGTGACCG TCTCCGGGAG CTGCATGTGT 
5521 GCGAGACGAA AGGGCCTCGT GATACGCCTA 
5581 GTTTCTTAGG ACGGATCGCT TGCCTGTAAC 
5641 AATAATTTGG GAATTTACTC TGTGTTTATT 
5701 AAGTAAATAA AGAAGGTAGA AGAGTTACGG 
5761 AAAAATTTCA ACAAAAAGCG TACTTTACAT 
5821 AATAGATATA CATTCGATTA ACGATAAGTA 
5881 GTCTTCTACA CAGACAAGAT GAAACAATTC 
5941 GATA AAAGGT AGTATTTGTT GGCGATCCCC 
6001 AACTAl I I I I TCTTTAATTT CI I 1 1 I I IAC 
6061 AAAAATTTAA ATTATAATTA TTTTTATAGC 
6121 TCGGGGAAAT GTGCGCGGAA CCCCTATTTG 
6181 TCCGCTCATG AGACAATAAC CCTGATAAAT 
6241 GAGTA TTCAA CATTTCCGTG TCGCCCTTAT 
6301 TTTTGCTCAC CCAGAAACGC TGGTGAAAGT 
6361 AGTGGG TTAC ATCGAACTGG ATCTCAACAG 
6421 AGAACGTTTT CCAATGATGA GCACTTTTAA 
6481 TATTGACGCC GGGCAAGAGC AACTCGGTCG 
6541 TGAGTACTCA CCAGTCACAG AAAAGCATCT 
6601 CAGTGCTGCC ATAACCATGA GTGATAACAC 
6661 AGGACCGAAG GAGCTAACCG CI 1 1 1 1 I ICA 
6721 TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT 
6781 TGTAGCAATG GCAACAACGT TGCGCAAACT 
6841 CCGGCAACAA TTAATAGACT GGATGGAGGC 
6901 GGCCCTTCCG GCTGGCTGGT TTATTGCTGA 
6961 CGGTATCATT GCAGCACTGG GGCCAGATGG 



ATAACGCCAT TTAATCTAAG CGCATCACCA 
ATAAAATGTA AGCTTTCGGG GCTCTCTTGC 
TCCAAAAGTT CACCTGTCCC ACCTGCTTCT 
TTCTGTGAAG CTGCACTGAG TAGTATGTTG 
ACTGGCAAAC CGAGGAACTC TTGGTATTCT 
ATATCAATGC CGTAATCATT GACCAGAGCC 
ACGCCA ACCA AGTATTTCGG AGTGCCTGAA 
ATTTTCCTTG CAATAACCGG GTCAATTGTT 
AGCAAGTCAG CATCGGAATC TAGAGCACAT 
ACTTTCACCA ATGGACCAGA ACTACCTGTG 
TTTGTGTGCT TAATCACGTA TACTCACGTG 
CCCTCTCCTT TTC 1 1 1 I 1 1 C GACCGAATTA 
GGATCAAGAT TGTACGTAAG GTGACAAGCT 
GCCATCTGGC GTCATAACTG CAAAGTACAC 
CCTATATTAT ATATATAGTA ATGTCGTTTA 
GCCGCATAGT TAAGCCAGCC CCGACACCCG 
TGTCTG CTCC CGGCATCCGC TTACAGACAA 
CAGAG GTTTT CACCGTCATC ACCGAAACGC 
TTTTTATAGG TTAATGTCAT GATAATAATG 
TTACACGCGC CTCGTATCTT TTAATGATGG 
TATTTTTATG TTTTGTATTT GGATTTTAGA 
AATGAAGAAA AAAAAATAAA CAAAGGTTTA 
ATATATTTAT TAGACAAGAA AAGCAGATTA 
AAATGTAAAA TCACAGGATT TTCGTGTGTG 
GGCATTAATA CCTGAGAGCA GGAAGAGCAA 
CTAGAGTCTT TTACATCTTC GGAAAACAAA 
TTTCTATTTT TAATTTATAT ATTTATATTA 
ACGTGATGAA AAGGACCCAG GTGGCACTTT 
TTTATTTTTC TAAATACATT CAAATATGTA 
GCTTCAATAA TATTGAAAAA GGAAGAGTAT 
TCCCI I I I I I GCGGCATTTT GCCTTCCTGT 
AAAAGATGCT GAAGATCAGT TGGGTGCACG 
CGGTAAGATC CTTGAGAGTT TTCGCCCCGA 
AGTTCTGCTA TGTGGCGCGG TATTATCCCG 
CCGCATACAC TATTCTCAGA ATGACTTGGT 
TACGGATGGC ATGACAGTAA GAGAATTATG 
TGCGGCCAAC TTACTTCTGA CAACGATCGG 
CAACATGGGG GATCATGTAA CTCGCCTTGA 
ACCAAACGAC GAGCGTGACA CCACGATGCC 
ATTAACTGGC GAACTACTTA CTCTAGCTTC 
GGATAAAGTT GCAGGACCAC TTCTGCGCTC 
TAAATCTGGA GCCGGTGAGC GTGGGTCTCG 
TAAGCCCTCC CGTATCGTAG TTATCTACAC 



FIG.42D 



129/253 



7021 6ACG6GCAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC 
7081 ACTGATTAAG CATT GGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT 
7141 AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC 
7201 CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA 
7261 AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC 
7321 ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT 
7381 AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG 
7441 CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC 
7501 AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT 
7561 ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA 
7621 GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CATTGAGAAA GCGCCACGCT 
7681 TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG 
7741 CACGAGGGAG CTTCCAGGGG GGAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA 
7801 CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCCGAGCC TATGGAAAAA 
7861 CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT 
7921 CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA 
7981 TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA 
8041 GCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA 
8101 CGACAGGTTT CCCGACTGGA AAGCGGGCAG TGAGCGCAAC GCAATTAATG TGAGTTACCT 
8161 CACTCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCCTATGT TGTGTGGAAT 
8221 TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC CATGATTACG CCAAGCTCGG 
8281 AATTAACCCT CACTAAAGGG AACAAAAGCT GGGTACCGGG CCCCCCCTCG AGATCCGGGA 
8341 TCGAAGAAAT GATGGTAAAT GAAATAGGAA ATCAAGGAGC ATGAAGGCAA AAGACAAATA 
8401 TAAGGGTCGA ACGAAAAATA AAGTGAAAAG TGTTGATATG ATGTATTTGG CTTTGCGGCG 
8461 CCGAAAAAAC GAGTTTACGC AATTGCACAA TCATGCTGAC TCTGTGGCGG ACCCGCGCTC 
8521 T TGCCG GCCC GGCGATAACG CTGGGCGTGA GGCTGTGCCC GGCGGAGTTT TTTGCGCCTG 
8581 CATTTTCCAA GGTTTACCCT GCGCTAAGGG GCGAGATTGG AGAAGCAATA AGAATGCCGG 
8641 TTGGGGTTGC GATGATGACG ACCACGACAA CTGGTGTCAT TATTTAAGTT GCCGAAAGAA 
8701 CCTGAGTGCA TTTGCAACAT GAGTATACTA GAAGAATGAG CCAAGACTTG CGAGACGCGA 
8761 GTTTGCCGGT GGTGCGAACA ATAGAGCGAC CATGACCTTG AAGGTGAGAC GCGCATAACC 
8821 GCTAGAGTAC TTTGAAGAGG AAACAGCAAT AGGGTTGCTA CCAGTATAAA TAGACAGGTA 
8881 CATACAACAC TGGAAATGGT TGTCTGTTTG AGTACGCTTT CAA 
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pDEST23 

His6 CARBOXY-FUSION VECTOR, T7 PROMOTER 



T7 PROMOTER 



205 ate ccg ego aat 
tog ggc get tta 



mRNA 



taa tac gac tea eta tajg ggb gac cac aac ggt ttc cct 
ott otq ctg oqt got ate cc|t ctg gtg ttg ceo aag gga 



Int 



attRT 



256 eta gat cac aag ttt |gta cao aa a age tga acg ago aac gta aaa tga tat 
gat eta g tg ttc aaa cot gtt tltt teg act tgc tct ttg cat ttt act ato, 

r t 



Cm 



ccdB 



1888 ttt tta tgc aaa ate taa ttt aat ata ttg ota ttt ata tea ttt tac gtt 
aaa aat acg ttt tag ott ooa tta tot aac tat oaa tat agt aoa atg cao 



ottR2 



A F L Y K V V 



1939 'tct cgt tea get tt |c ttg tac oaa gtg gtg 
? cga gca ogt ego aag ooc atg | ttt cac cac 



I M S Y Y H H 

att atg teg tac tac cat cac 

taa tac age atg otg ^gto gtg 

His6 1 

HHHHLDE Q TERM 

1990 cat coc cat cac etc gat gag caa eta gca taa ccc ctt ggg gec tct 

gta gtg gta gtg, gag eta etc gtt att gat cgt att ggg gaa ccc egg ago 



b 



Nrul 
BstAPI- ■ 



Sphl 
EcoNI 
PshAI 



SgrAL 
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pDEST23 6264 bp 



Location (Base Nos.) 
285.. 161 



Gene Encoded 



attRl 
CmR 



394.. 1053 
1173.. 1257 
1395 . . 1700 
1741 . . 1865 
1883.. 1911 
2574.. 3434 
3583.. 4222 



ccdB 

attR2 

his6 

ampR 

ori 



inactivated ccdA 



1 TCTTCCCCAT CGGTGATGTC GGCGATATAG GCGCCAGCAA CCGCACCTGT GGCGCCGGTG 
61 ATGCCGGCCA CGATGCGTCC GGCGTAGAGG ATCGAGATCT CGATCCCGCG AAATTAATAC 
121 GACTCACTAT AGGGAGACCA CAACGGTTTC CCTCTAGATC ACAAGTTTGT ACAAAAAAGC 
181 TGAACGAGAA ACGTAAAATG ATATAAATAT CAATATATTA AATTAGATTT TGCATAAAAA 
241 ACAGACTACA TAATACTGTA AAACACAACA TATCCAGTCA CTATGGCGGC CGCATTAGGC 
301 ACCCCAGGCT TTACACTTTA TGCTTCCGGC TCGTATAATG TGTGGATTTT GAGTTAGGAT 
361 CCGGCGAGAT TTTCAGGAGC TAAGGAAGCT AAAATGGAGA AAAAAATCAC TGGATATACC 
421 ACCGTTGATA TATCCCAATG GCATCGTAAA GAACATTTTG AGGCATTTCA GTCAGTTGCT 
481 CAATGTACCT ATAACCAGAC CGTTCAGCTG GATATTACGG CCTTTTTAAA GACCGTAAAG 
541 AAAAATAAGC ACAAGTTTTA TCCGGCCTTT ATTCACATTC TTGCCCGCCT GATGAATGCT 
601 CATCCGGAAT TCC GTAT GGC AATGAAAGAC GGTGAGCTGG TGATATGGGA TAGTGTTCAC 
661 CCTTGTTACA CCGTTTTCCA TGAGCAAACT GAAACGTTTT CATCGCTCTG GAGTGAATAC 
721 CACGACGATT TCCGGCAGTT TCTACACATA TATTCGCAAG ATGTGGCGTG TTACGGTGAA 
781 AACCTGGCCT ATTTCCCTAA AGGGTTTATT GAGAATATGT TTTTCGTCTC AGCCAATCCC 
841 T GGGT GAGTT TCACCAGTTT TGATTTAAAC GTGGCCAATA TGGACAACTT CTTCGCCCCC 
901 GTTTTCACCA TGGGCAAATA TTATACGCAA GGCGACAAGG TGCTGATGCC GCTGGCGATT 
961 CAGGTTCATC ATGCCGTCTG TGATGGCTTC CATGTCGGCA GAATGCTTAA TGAATTACAA 
1021 CAGTACTGCG ATGAGTGGCA GGGCGGGGCG TAAACGCGTG GATCCGGCTT ACTAAAAGCC 
1081 AGATAACAGT ATGCGTATTT GCGCGCTGAT TTTTGCGGTA TAAGAATATA TACTGATATG 
1141 TATACCCGAA GTATGTCAAA AAGAGGTGTG CTATGAAGCA GCGTATTACA GTGACAGTTG 
1201 ACAGCGACAG CTATCAGTTG CTCAAGGCAT ATATGATGTC AATATCTCCG GTCTGGTAAG 
1261 CACAACCATG CAGAATGAAG CCCGTCGTCT GCGTGCCGAA CGCTGGAAAG CGGAAAATCA 
1321 GGAAGGGATG GCTGAGGTCG CCCGGTTTAT TGAAATGAAC GGCTCTTTTG CTGACGAGAA 
1381 CAGGGACTGG TGAAATGCAG TTTAAGGTTT ACACCTATAA AAGAGAGAGC CGTTATCGTC 
1441 TGTTTGTGGA TGTACAGAGT GATATTATTG ACACGCCCGG GCGACGGATG GTGATCCCCC 
1501 TGGCCAGTGC ACGTCTGCTG TCAGATAAAG TCTCCCGTGA ACTTTACCCG GTGGTGCATA 
1561 TCGGGGATGA AAGCTGGCGC ATGATGACCA CCGATATGGC CAGTGTGCCG GTCTCCGTTA 
1621 TCGGGGAAGA AGTGGCTGAT CTCAGCCACC GCGAAAATGA CATCAAAAAC GCCATTAACC 
1681 TGATGTTCTG GGGAATATAA ATGTCAGGCT CCCTTATACA CAGCCAGTCT GCAGGTCGAC 
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1741 CATAGTGACT GGATATGTTG TGTTTTACAG 
1801 TAATTTAATA TATTGATATT TATATCATTT 
1861 GTGGTGATTA TGTCGTACTA CCATCACCAT 
1921 TAACCCCTTG GGGCCTCTAA ACGGGTCTTG 
1981 TCCGGATATC CACAGGACGG GTGTGGTCGC 
2041 TAGCGAAGCG AGCAGGACTG GGCGGCGGCC 
2101 TGCGCATAGA AATTGCATCA ACGCATATAG 
2161 GCTGTCGGAA TGGACGATAT CCCGCAAGAG 
2221 GCCTACAGCA TCCAGGGTGA CGGTGCCGAG 
2281 ACACGGTGCC TGACTGCGTT AGCAATTTAA 
2341 GATGATAAGC TGTCAAACAT GAGAATTCTT 
2401 TTTTATAGGT TAATGTCATG ATAATAATGG 
2461 GAAATGTGCG CGGAACCCCT ATTTGTTTAT 
2521 TCATGAGACA ATAACCCTGA TAAATGCTTC 
2581 TTCAACATTT CCGTGTCGCC CTTATTCCCT 
2641 CTCACCCAGA AACGCTGGTG AAAGTAAAAG 
2701 GTTACATCGA ACTGGATCTC AACAGCGGTA 
2761 GTTTTCCAAT GATGAGCACT TTTAAAGTTC 
2821 ACGCCGGGCA AGAGCAACTC GGTCGCCGCA 
2881 ACTCACCAGT CACAGAAAAG CATCTTACGG 
2941 CTGCCATAAC CATGAGTGAT AACACTGCGG 
3001 CGAAGGAGCT AACCGCTTTT TTGCACAACA 
3061 GGGAACCGGA GCTGAATGAA GCCATACCAA 
3121 CAATGGCAAC AACGTTGCGC AAACTATTAA 
3181 AACAATTAAT AGACTGGATG GAGGCGGATA 
3241 TTCCGGCTGG CTGGTTTATT GCTGATAAAT 
3301 TCATTGCAGC ACTGGGGCCA GATGGTAAGC 
3361 GGAGTCAGGC AACTATGGAT GAACGAAATA 
3421 TTAAGCATTG GTAACTGTCA GACCAAGTTT 
3481 TTCATTTTTA ATTTAAAAGG ATCTAGGTGA 
3541 TCCCTTAACG TGAGTTTTCG TTCCACTGAG 
3601 CTTCTTGAGA TCCI 1 1 I I I I CTGCGCGTAA 
3661 TACCAGCGGT GGTTTGTTTG CCGGATCAAG 
3721 GCTTCAGCAG AGCGCAGATA CCAAATACTG 
3781 ACTTCAAGAA CTCTGTAGCA CCGCCTACAT 
3841 CTGCTGCCAG TGGCGATAAG TCGTGTCTTA 
3901 ATAAGGCGCA GCGGTCGGGC TGAACGGGGG 
3961 CGACCTACAC CGAACTGAGA TACCTACAGC 
4021 AAGGGAGAAA GGCGGACAGG TATCCGGTAA 



TATTATGTAG TCTGI I I I I I ATGCAAAATC 
TACGTTTCTC GTTCAGCTTT CTTGTACAAA 
CACCATCACC TCGATGAGCA ATAACTAGCA 
AGGGGTTTTT TGCTGAAAGG AGGAACTATA 
CATGATCGCG TAGTCGATAG TGGCTCCAAG 
AAAGCGGTCG GACAGTGCTC CGAGAACGGG 
CGCTAGCAGC ACGCCATAGT GACTGGCGAT 
GCCCGGCAGT ACCGGCATAA CCAAGCCTAT 
GATGACGATG AGCGCATTGT TAGATTTCAT 
CTGTGATAAA CTACCGCATT AAAGCTTATC 
GAAGACGAAA GGGCCTCGTG ATACGCCTAT 
TTTCTTAGAC GTCAGGTGGC ACTTTTCGGG 
TTTTCTAAAT ACATTCAAAT ATGTATCCGC 
AATAATATTG AAAAAGGAAG AGTATGAGTA 
TTTTTGCGGC ATTTTGCCTT CCTGTTTTTG 
ATGCTGAAGA TCAGTTGGGT GCACGAGTGG 
AGATCCTTGA GAGTTTTCGC CCCGAAGAAC 
TGCTATGTGG CGCGGTATTA TCCCGTGTTG 
TACACTATTC TCAGAATGAC TTGGTTGAGT 
ATGGCATGAC AGTAAGAGAA TTATGCAGTG 
CCAACTTACT TCTGACAACG ATCGGAGGAC 
TGGGGGATCA TGTAACTCGC CTTGATCGTT 
ACGACGAGCG TGACACCACG ATGCCTGCAG 
CTGGCGAACT ACTTACTCTA GCTTCCCGGC 
AAGTTGCAGG ACCACTTCTG CGCTCGGCCC 
CTGGAGCCGG TGAGCGTGGG TCTCGCGGTA 
CCTCCCGTAT CGTAGTTATC TACACGACGG 
GACAGATCGC TGAGATAGGT GCCTCACTGA 
ACTCATATAT ACTTTAGATT GATTTAAAAC 
AGATCCTTTT TGATAATCTC ATGACCAAAA 
CGTCAGACCC CGTAGAAAAG ATCAAAGGAT 
TCTGCTGCTT GCAAACAAAA AAACCACCGC 
AGCTACCAAC TCTTTTTCCG AAGGTAACTG 
TCCTTCTAGT GTAGCCGTAG TTAGGCCACC 
ACCTCGCTCT GCTAATCCTG TTACCAGTGG 
CCGGGTTGGA CTCAAGACGA TAGTTACCGG 
GTTCGTGCAC ACAGCCCAGC TTGGAGCGAA 
GTGAGCTATG AGAAAGCGCC ACGCTTCCCG 
GCGGCAGGGT CGGAACAGGA GAGCGCACGA 



F1G.43C 
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4081 GGGAGCTTCC AGGGGGAAAC GCCTGGTATC TTTATAGTCC TGTCGGGTTT CGCCACCTCT 
4141 GACTTGAGCG TCGATTTTTG TGATGCTCGT CAGGGGGGCG GAGCCTATGG AAAAACGCCA 
4201 GCAACGCGGC CTTTTTACGG TTCCTGGCCT TTTGCTGGCC TTTTGCTCAC ATGTTCTTTC 
4261 CTGCGTTATC CCCTGATTCT GTGGATAACC GTATTACCGC CTTTGAGTGA GCTGATACCG 
4321 CTCGCCGCAG CCGAACGACC GAGCGCAGCG AGTCAGTGAG CGAGGAAGCG GAAGAGCGCC 
4381 TGATGCGGTA TTTTCTCCTT ACGCATCTGT GCGGTATTTC ACACCGCATA TATGGTGCAC 
4441 TCTCAGTACA ATCTGCTCTG ATGCCGCATA GTTAAGCCAG TATACACTCC GCTATCGCTA 
4501 CGTGACTGGG TCATGGCTGC GCCCCGACAC CCGCCAACAC CCGCTGACGC GCCCTGACGG 
4561 GCTTGTCTGC TCCCGGCATC CGCTTACAGA CAAGCTGTGA CCGTCTCCGG GAGCTGCATG 
4621 TGTCAGAGGT TTTCACCGTC ATCACCGAAA CGCGCGAGGC AGCTGCGGTA MGCTCATCA 
4681 GCGTGGTCGT GAAGCGATTC ACAGATGTCT GCCTGTTCAT CCGCGTCCAG CTCGTTGAGT 
4741 TTCTCCAGAA GCGTTAATGT CTGGCTTCTG ATAAAGCGGG CCATGTTAAG GGCGGTTTTT 
4801 TCCTGTTTGG TCACTGATGC CTCCGTGTAA GGGGGATTTC TGTTCATGGG GGTAATGATA 
4861 CCGATGAAAC GAGAGAGGAT GCTCACGATA CGGGTTACTG ATGATGAACA TGCCCGGTTA 
4921 CTGGAACGTT GTGAGGGTAA ACAACTGGCG GTATGGATGC GGCGGGACCA GAGAAAAATC 
4981 ACTCAGGGTC AATGCCAGCG CTTCGTTAAT ACAGATGTAG GTGTTCCACA GGGTAGCCAG 
5041 CAGCATCCTG CGATGCAGAT CCGGAACATA ATGGTGCAGG GCGCTGACTT CCGCGTTTCC 
5101 AGACTTTACG AAACACGGAA ACCGAAGACC ATTCATGTTG TTGCTCAGGT CGCAGACGTT 
5161 TTGCAGCAGC AGTCGCTTCA CGTTCGCTCG CGTATCGGTG ATTCATTCTG CTAACCAGTA 
5221 AGGCAACCCC GCCAGCCTAG CCGGGTCCTC AACGACAGGA GCACGATCAT GCGCACCCGT 
5281 GGCCAGGACC CAACGCTGCC CGAGATGCGC CGCGTGCGGC TGCTGGAGAT GGCGGACGCG 
5341 ATGGATATGT TCTGCCAAGG GTTGGTTTGC GCATTCACAG TTCTCCGCAA GAATTGATTG 
5401 GCTCCAATTC TTGGAGTGGT GAATCCG™ GCGAGGTGCC GCCGGCTTCC ATTCAGGTCG 
5461 AGGTGGCCCG GCTCCATGCA CCGCGACGCA ACGCGGGGAG GCAGACAAGG TATAGGGCGG 
5521 CGCCTACAAT CCATGCCAAC CCGTTCCATG TGCTCGCCGA GGCGGCATAA ATCGCCGTGA 
5581 CGATCAGCGG TCCAGTGATC GAAGTTAGGC TGGTAAGAGC CGCGAGCGAT CCTTGAAGCT 
5641 GTCCCTGATG GTCGTCATCT ACCTGCCTGG ACAGCATGGC CTGCAACGCG GGCATCCCGA 
5701 TGCCGCCGGA AGCGAGAAGA ATCATAATGG GGAAGGCCAT CCAGCCTCGC GTCGCGAACG 
5761 CCAGCAAGAC GTAGCCCAGC GCGTCGGCCG CCATGCCGGC GATAATGGCC TGCTTCTCGC 
5821 CGAAACGTTT GGTGGCGGGA CCAGTGACGA AGGCTTGAGC GAGGGCGTGC AAGATTCCGA 
5881 ATACCGCAAG CGACAGGCCG ATCATCGTCG CGCTCCAGCG AAAGCGGTCC TCGCCGAAAA 
5941 TGACCCAGAG CGCTGCCGGC ACCTGTCCTA CGAGTTGCAT GATAAAGAAG ACAGTCATAA 
6001 GTGCGGCGAC GATAGTCATG CCCCGCGCCC ACCGGAAGGA GCTGACTGGG TTGAAGGCTC 
6061 TCAAGGGCAT CGGTCGATCG ACGCTCTCCC TTATGCGACT CCTGCATTAG GAAGCAGCCC 
6121 AGTAGTAGGT TGAGGCCGTT GAGCACCGCC GCCGCAAGGA ATGGTGCATG CAAGGAGATG 
6181 GCGCCCAACA GTCCCCCGGC CACGGGGCCT GCCACCATAC CCACGCCGAA ACAAGCGCTC 
6241 ATGAGCCCGA AGTGGCGAGC CCGA 
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T7 PROMOTER 



1 ate gog ate teg ate ccg ego oot 
tag etc tag age tag ggc get tta 



taa tac gac tea eta tag ggo gac cac 
att atg ctg agt gat at|c cct ctg gtg 



mRNA 



Int cttRI 

52 aac ggt ttc cct eta gat clac aag ttt [gta caa oa a age tga acg aga cac 
ttg cca aag gga gat eta g tg ttc aao cat gtt til t teg act tgc tct ttg 

f 



^ 



Cm 



R 



ccdB 



1 



attR2 



A F L Y K V V I M S 



1735 'tea ttt tac gtt tct cgt tea get tt |c ttg tac aaa gtg gtb att atg tec 

ttt cac cac taa tac ogg 



} 



agt aaa atg coo ago gco agt ego aag aac atg 



I L 



GST PROTEIN 



(—223 aa) 



1786 cct ata eta ggt tat tgg aaa att aag ggc ctt gtg caa ccc act ego ctt 
gga tat gat cca ata acc ttt taa ttc ccg gaa cac gtt ggg tga get gaa 
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PDEST24 6961 bp 



Location (Base Nos. ) 
195.. 71 
304.. 963 



Gene Encoded 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 



1083.. 1167 
1305.. 1610 
1651.. 1775 
1783.. 2451 
3181.. 4041 
4190.. 4829 



GST 

ampR 

ori 



1 ATCGAGATCT CGATCCCGCG AAATTAATAC GACTCACTAT AGGGAGACCA CAACGGTTTC 
61 CCTCTAGATC ACAAGTTTGT ACAAAAAAGC TGAACGAGAA ACGTAAAATG ATATAAATAT 
121 CAATATATTA AATTAGATTT TGCATAAAAA ACAGACTACA TAATACTGTA AAACACAACA 
181 TATCCAGTCA CTATGGCGGC CGCATTAGGC ACCCCAGGCT TTACACTTTA TGCTTCCGGC 
241 TCGTATAATG TGTGGATTTT GAGTTAGGAT CCGGCGAGAT TTTCAGGAGC TAAGGAAGCT 
301 AAAATGGAGA AAAAAATCAC TGGATATACC ACCGTTGATA TATCCCAATG GCATCGTAAA 
361 GAACATTTTG AG GCATTT CA GTCAGTTGCT CAATGTACCT ATAACCAGAC CGTTCAGCTG 
421 GATATTACGG CCTTTTTAAA GACCGTAAAG AAAAATAAGC ACAAGTTTTA TCCGGCCTTT 
481 ATTCACATTC TTGCCCGCCT GATGAATGCT CATCCGGAAT TCCGTATGGC AATGAAAGAC 
541 GGTGAG CTGG TGATATGGGA TAGTGTTCAC CCTTGTTACA CCGTTTTCCA TGAGCAAACT 
601 GAAACGTTTT CATCGCTCTG GAGTGAATAC CACGACGATT TCCGGCAGTT TCTACACATA 
661 TATTCGCAAG ATGT GGCGTG TTACGGTGAA AACCTGGCCT ATTTCCCTAA AGGGTTTATT 
721 GAGAATATGT TTTTCGTCTC AGCCAATCCC TGGGTGAGTT TCACCAGTTT TGATTTAAAC 
781 GTGGCCAATA TGGACAACTT CTTCGCCCCC GTTTTCACCA TGGGCAAATA TTATACGCAA 
841 GGCGACAAGG TGCTGATGCC GCTGGCGATT CAGGTTCATC ATGCCGTCTG TGATGGCTTC 
901 CATGTCGGCA GAATGCTTAA TGAATTACAA CAGTACTGCG ATGAGTGGCA GGGCGGGGCG 
961 TAAA CGCGTG GATCCGGCTT ACTAAAAGCC AGATAACAGT ATGCGTATTT GCGCGCTGAT 
1021 TTTTGCGGTA TAAGAATATA TACTGATATG TATACCCGAA GTATGTCAAA AAGAGGTGTG 
1081 CTATGAAGCA GCGTATTACA GTGACAGTTG ACAGCGACAG CTATCAGTTG CTCAAGGCAT 
1141 ATATGATGTC AATATCTCCG GTCTGGTAAG CACAACCATG CAGAATGAAG CCCGTCGTCT 
1201 GCGTGCCGAA CGCTGGAAAG CGGAAAATCA GGAAGGGATG GCTGAGGTCG CCCGGTTTAT 
1261 TGAAATGAAC GGCTCTTTTG CTGACGAGAA CAGGGACTGG TGAAATGCAG TTTAAGGTTT 
1321 ACACCTATAA AAGAGAGAGC CGTTATCGTC TGTTTGTGGA TGTACAGAGT GATATTATTG 
1381 ACACGCCCGG GCGACGGATG GTGATCCCCC TGGCCAGTGC ACGTCTGCTG TCAGATAAAG 
1441 TCTCCCGTGA ACTTTACCCG GTGGTGCATA TCGGGGATGA AAGCTGGCGC ATGATGACCA 
1501 CCGATATGGC CAGTGTGCCG GTCTCCGTTA TCGGGGAAGA AGTGGCTGAT CTCAGCCACC 
1561 GCGAAAATGA CATCAAAAAC GCCATTAACC TGATGTTCTG GGGAATATAA ATGTCAGGCT 
1621 CCCTTATACA CAGC CAGTCT GCAGGTCGAC CATAGTGACT GGATATGTTG TGTTTTACAG 
1681 TATTATGTAG TCTGI I 1 1 1 I ATGCAAAATC TAATTTAATA TATTGATATT TATATCATTT 
1741 TACGTTTCTC GTTCAGCTTT CTTGTACAAA GTGGTGATTA TGTCCCCTAT ACTAGGTTAT 
1801 TGGAAAATTA AGGGCCTTGT GCAACCCACT CGACTTCTTT TGGAATATCT TGAAGAAAAA 
1861 TATGAAGAGC ATTTGTATGA GCGCGATGAA GGTGATAAAT GGCGAAACAA AAAGTTTGAA 
1921 TTGGGTTTGG AGTTTCCCAA TCTTCCTTAT TATATTGATG GTGATGTTAA ATTAACACAG 
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1981 TCTATGGCCA TCATACGTTA TATAGCTGAC 
2041 GAGCGTGCAG AGATTTCAAT GCTTGAAGGA 
2101 AGAATTGCAT ATAGTAAAGA CTTTGAAACT 
2161 GAAATGCTGA AAATGTTCGA AGATCGTTTA 
2221 GTAACCCATC CTGACTTCAT GTTGTATGAC 
2281 ATGTGCCTGG ATGCGTTCCC AAAATTAGTT 
2341 CAAATTGATA AGTACTTGAA ATCCAGCAAG 
2401 GCCACGTTTG GTGGTGGCGA CCATCCTCCA 
2461 TCCGGCTGCT AACAAAGCCC GAAAGGAAGC 
2521 ACTAGCATAA CCCCTTGGGG CCTCTAAACG 
2581 AACTATATCC GGATATCCAC AGGACGGGTG 
2641 CTCCAAGTAG CGAAGCGAGC AGGACTGGGC 
2701 GAACGGGTGC GCATAGAAAT TGCATCAACG 
2761 TGGCGATGCT GTCGGAATGG ACGATATCCC 
2821 AGCCTATGCC TACAGCATCC AGGGTGACGG 
2881 ATTTCATACA CGGTGCCTGA CTGCGTTAGC 
2941 GCTTATCGAT GATAAGCTGT CAAACATGAG 
3001 CGCCTATTTT TATAGGTTAA TGTCATGATA 
3061 TTTCGGGGAA ATGTGCGCGG AACCCCTATT 
3121 TATCCGCTCA TGAGACAATA ACCCTGATAA 
3181 ATGAGTATTC AACATTTCCG TGTCGCCCTT 
3241 GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA 
3301 CGAGTGGGTT ACATCGAACT GGATCTCAAC 
3361 GAAGAACGTT TTCCAATGAT GAGCACTTTT 
3421 CGTGTTGACG CCGGGCAAGA GCAACTCGGT 
3481 GTTGAGTACT CACCAGTCAC AGAAAAGCAT 
3541 TGCAGTGCTG CCATAACCAT GAGTGATAAC 
3601 GGAGGACCGA AGGAGCTAAC CGCI I I I I IG 
3661 GATCGTTGGG AACCGGAGCT GAATGAAGCC 
3721 CCTGCAGCAA TGGCAACAAC GTTGCGCAAA 
3781 TCCCGGCAAC AATTAATAGA CTGGATGGAG 
3841 TCGGCCCTTC CGGCTGGCTG GTTTATTGCT 
3901 CGCGGTATCA TTGCAGCACT GGGGCCAGAT 
3961 ACGACGGGGA GTCAGGCAAC TATGGATGAA 
4021 TCACTGATTA AGCATTGGTA ACTGTCAGAC 
4081 TTAAAACTTC ATTTTTAATT TAAAAGGATC 
4141 ACCAAAATCC CTTAACGTGA GTTTTCGTTC 
4201 AAAGGATCTT CTTGAGATCC 1 1 1 1 1 I ICTG 
4261 CCACCGCTAC CAGCGGTGGT TTGTTTGCCG 
4321 GTAACTGGCT TCAGCAGAGC GCAGATACCA 
4381 GGCCACCACT TCAAGAACTC TGTAGCACCG 
4441 CCAGTGGCTG CTGCCAGTGG CGATAAGTCG 
4501 TTACCGGATA AGGCGCAGCG GTCGGGCTGA 
4561 GAGCGAACGA CCTACACCGA ACTGAGATAC 
4621 CTTCCCGAAG GGAGAAAGGC GGACAGGTAT 
4681 CGCACGAGGG AGCTTCCAGG GGGAAACGCC 



AAGCACAACA TGTTGGGTGG TTGTCCAAAA 
GCGGTTTTGG A TATTA GATA CGGTGTTTCG 
CTCAAAGTTG ATTTTCTTAG CAAGCTACCT 
TGTCATAAAA CATATTTAAA TGGTGATCAT 
GCTCTTGATG TTGTTTTATA CATGGACCCA 
TGTTTTAAAA AACGTATTGA AGCTATCCCA 
TATATAGCAT GGCCTTTGCA GGGCTGGCAA 
AAATCGGATC TGGTTCCGCG TCCATGGGGA 
TGAGTTGGCT GCTGCCACCG CTGAGCAATA 
GGTCTTGAGG GGI 1 1 1 1 IGC TGAAAGGAGG 
TGGTCGCCAT GATCGCGTAG TCGATAGTGG 
GGCGGCCAAA GCGGTCGGAC AGTGCTCCGA 
CATATAGCGC TAGCAGCACG CCATAGTGAC 
GCAAGAGGCC CGGCAGTACC GGCATAACCA 
TGCCGAGGAT GACGATGAGC GCATTGTTAG 
AATTTAACTG TGATAAACTA CCGCATTAAA 
AATTCTTGAA GACGAAAGGG CCTCGTGATA 
ATAATG GTTT CTTAGACGTC AGGTGGCACT 
TGTTTATTTT TCTAAATACA TTCAAATATG 
ATGCTT CAAT AATATTGAAA AAGGAAGAGT 
ATTCCCTTTT TTGCGGCATT TTGCCTTCCT 
GTAAAAGATG CTGAAGATCA GTTGGGTGCA 
AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC 
AAAGTTCTGC TATGTGGCGC GGTATTATCC 
CGCCGCATAC ACTATTCTCA GAATGACTTG 
CTTACGGATG GCATGACAGT AAGAGAATTA 
ACTGCGGCCA ACTTACTTCT GACAACGATC 
CACAACATGG GGGATCATGT AACTCGCCTT 
ATACCAAACG ACGAGCGTGA CACCACGATG 
CTATTAACTG GCGAACTACT TACTCTAGCT 
GCGGATAAAG TTGCAGGACC ACTTCTGCGC 
GATAAATCTG GAGCCGGTGA GCGTGGGTCT 
GGTAAGCCCT CCCGTATCGT AGTTATCTAC 
CGAAATAGAC AGATCGCTGA GATAGGTGCC 
CAAGTTTACT CATATATACT TTAGATTGAT 
TAGGTGAAGA TCCTTTTTGA TAATCTCATG 
CACTGAGCGT CAGACCCCGT AGAAAAGATC 
CGCGTAATCT GCTGCTTGCA AACAAAAAAA 
GATCAAGAGC TACCAACTCT TTTTCCGAAG 
AATACTGTCC TTCTAGTGTA GCCGTAGTTA 
CCTACATACC TCGCTCTGCT AATCCTGTTA 
TGTCTTACCG GGTTGGACTC AAGACGATAG 
ACGGGGGGTT CGTGCACACA GCCCAGCTTG 
CTACAGCGTG AGCTATGAGA AAGCGCCACG 
CCGGTAAGCG GCAGGGTCGG AACAGGAGAG 
TGGTATCTTT ATAGTCCTGT CGGGTTTCGC 
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4741 CACCTCTGAC TTGAGCGTCG ATTTTTGTGA 
4801 AACGCCAGCA ACGCGGCCTT TTTACGGTTC 
4861 TTCTTTCCTG CGTTATCCCC TGATTCTGTG 
4921 GATACCGCTC GCCGCAGCCG AACGACCGAG 
4981 GAGCGCCTGA TGCGGTATTT TCTCCTTACG 
5041 GGTGCACTCT CAGTACAATC TGCTCTGATG 
5101 ATCGCTACGT GACTGGGTCA TGGCTGCGCC 
5161 CTGACGGGCT TGTCTGCTCC CGGCATCCGC 
5221 CTGCATGTGT CAGAGGTTTT CACCGTCATC 
5281 CTCATCAGCG TGGTCGTGAA GCGATTCACA 
5341 GT TGAGTTT C TCCAGAAGCG TTAATGTCTG 
5401 GGI I I I I ICC TGTTTGGTCA CTGATGCCTC 
5461 AATGATACCG ATGAAACGAG AGAGGATGCT 
5521 CCGGTTACTG GAACGTTGTG AGGGTAAACA 
5581 AAAAATCACT CAGGGTCAAT GCCAGCGCTT 
5641 TAGCCAGCAG CATCCTGCGA TGCAGATCCG 
5701 CGTTTCCAGA CTTTACGAAA CACGGAAACC 
5761 AGACGTTTTG CAGCAGCAGT CGCTTCACGT 
5821 ACCAGTAAGG CAACCCCGCC AGCCTAGCCG 
5881 CACCCGTGGC CAGGACCCAA CGCTGCCCGA 
5941 GGACGCGATG GATATGTTCT GCCAAGGGTT 
6001 TTGATTGGCT CCAATTCTTG GAGTGGTGAA 
6061 CAGGTCGAGG TGGCCCGGCT CCATGCACCG 
6121 AGGGCGGCGC CTACAATCCA TGCCAACCCG 
6181 GCCGTGACGA TCAGCGGTCC AGTGATCGAA 
6241 TGAAGCTGTC CCTGATGGTC GTCATCTACC 
6301 ATCCCGATGC CGCCGGAAGC GAGAAGAATC 
.6361 GCGAACGCCA GCAAGACGTA GCCCAGCGCG 
6421 TTCTCGCCGA AACGTTTGGT GGCGGGACCA 
6481 ATTCCGAATA CCGCAAGCGA CAGGCCGATC 
6541 CCGAAAATGA CCCAGAGCGC TGCCGGCACC 
6601 GTCATAAGTG CGGCGACGAT AGTCATGCCC 
6661 AAGGCTCTCA AGGGCATCGG TCGATCGACG 
6721 GCAGCCCAGT AGTAGGTTGA GGCCGTTGAG 
6781 GGAGATGGCG CCCAACAGTC CCCCGGCCAC 
6841 AGCGCTCATG AGCCCGAAGT GGCGAGCCCG 
6901 GGCGCCAGCA ACCGCACCTG TGGCGCCGGT 
6961 G 



TGCTCG TCAG GGGGGCGGAG CCTATGGAAA 
CTGGCCTTTT GCTGGCCTTT TGCTCACATG 
GATAACCGTA TTACCGCCTT TGAGTGAGCT 
CGCAGCGAGT CAGTGAGCGA GGAAGCGGAA 
CATCTGTGCG GTATTTCACA CCGCATATAT 
CCGCATAGTT AAGCCAGTAT ACACTCCGCT 
CCGACACCCG CCAACACCCG CTGACGCGCC 
TTACAGACAA GCTGTGACCG TCTCCGGGAG 
ACCGAAACGC GCGAGGCAGC TGCGGTAAAG 
GATGTCTGCC TGTTCATCCG CGTCCAGCTC 
GCTTCTGATA AAGCGGGCCA TGTTAAGGGC 
CGTGTAAGGG GGATTTCTGT TCATGGGGGT 
CACGATACGG GTTACTGATG ATGAACATGC 
ACTGGCGGTA TGGATGCGGC GGGACCAGAG 
CGTTAATACA GATGTAGGTG TTCCACAGGG 
GAACATAATG GTGCAGGGCG CTGACTTCCG 
GAAGACCATT CATGTTGTTG CTCAGGTCGC 
TCGCTCGCGT ATCGGTGATT CATTCTGCTA 
GGTCCTCAAC GACAGGAGCA CGATCATGCG 
GATGCGCCGC GTGCGGCTGC TGGAGATGGC 
GGTTTGCGCA TTCACAGTTC TCCGCAAGAA 
TCCGTTAGCG AGGTGCCGCC GGCTTCCATT 
CGACGCAACG CGGGGAGGCA GACAAGGTAT 
TTCCATGTGC TCGCCGAGGC GGCATAAATC 
GTTAGGCTGG TAAGAGCCGC GAGCGATCCT 
TGCCTGGACA GCATGGCCTG CAACGCGGGC 
ATAATGGGGA AGGCCATCCA GCCTCGCGTC 
TCGGCCGCCA TGCCGGCGAT AATGGCCTGC 
GTGACGAAGG CTTGAGCGAG GGCGTGCAAG 
ATCGTCGCGC TCCAGCGAAA GCGGTCCTCG 
TGTCCTACGA GTTGCATGAT AAAGAAGACA 
CGCGCCCACC GGAAGGAGCT GACTGGGTTG 
CTCTCCCTTA TGCGACTCCT GCATTAGGAA 
CACCGCCGCC GCAAGGAATG GTGCATGCAA 
GGGGCCTGCC ACCATACCCA CGCCGAAACA 
ATCTTCCCCA TCGGTGATGT CGGCGATATA 
GATGCCGGCC ACGATGCGTC CGGCGTAGAG 
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THIOREDOXIN CARBOXY-FUSION VECTOR, T7 PROMOTER 



T7 PROMOTER 



1 nag ate teg ate ccg cga aat 
ntc tag age tag ggc get tta 



taa tac gac tea eta tcjg gg|a gac cac aac 
att atg ctg agt gat al 



-mRNA 



tc cc 



t ctg gtg ttg 



52 ggt ttc cct eta gat cac aag ttt 



nt 



attRI 



gtacaaaaa age tga acg ago aac gta 



cca aag gga gat eta gcg ttc aao cat gtt t|tt teg act tgc tct ttg cat 



Cm 



R 



ccdB 



attR2 



Int 
F L 



Y K V V I M S D 



1735 ttt tac gtt tct cgt tea get tt |c ttg tac aaa gtg gtb att atg age gat 
jOoa atg coo ago gca agt ego oog aac atgjttt cac cc c taa tac teg eta 



K I ! 



Trx PROTEIN (~130 aa.) — 



1786 aaa ott att cac ctg act gac gac agt ttt gac acg gat gta etc aaa gcg 
ttt taa taa gtg gac tga ctg ctg tea aao ctg tgc eta cat gag ttt cgc 




: -nina 
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pDEST25 6652 bp 



Location (Base Nos.) 
844 . . 720 



Gene Encoded 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

trx 



953.. 1612 
1732.. 1816 
1954.. 2259 
2300.. 2424 
2432 . . 2794 



1 CCGGAA6CGA GAAGAATCAT AATGGGGAAG GCCATCCAGC CTCGCGTCGC GAACGCCAGC 
61 AAGACGTAGC CCAGCGCGTC GGCCGCCATG CCGGCGATAA TGGCCTGCTT CTCGCCGAAA 
121 CGTTTGGTGG CGGGACCAGT GACGAAGGCT TGAGCGAGGG CGTGCAAGAT TCCGAATACC 
181 GCAAGCGACA GGCCGATCAT CGTCGCGCTC CAGCGAAAGC GGTCCTCGCC GAAAATGACC 
241 CAGAGCGCTG CCGGCACCTG TCCTACGAGT TGCATGATAA AGAAGACAGT CATAAGTGCG 
301 GCGACGATAG TCATGCCCCG CGCCCACCGG AAGGAGCTGA CTGGGTTGAA GGCTCTCAAG 
361 GGCATCGGTC GATCGACGCT CTCCCTTATG CGACTCCTGC ATTAGGAAGC AGCCCAGTAG 
421 TAGGTTGAGG CCGTTGAGCA CCGCCGCCGC AAGGAATGGT GCATGCAAGG AGATGGCGCC 
481 CAACAGTCCC CCGGCCACGG GGCCTGCCAC CATACCCACG CCGAAACAAG CGCTCATGAG 
541 CCCGAAGTGG CGAGCCCGAT CTTCCCCATC GGTGATGTCG GCGATATAGG CGCCAGCAAC 
601 CGCACCTGTG GCGCCGGTGA TGCCGGCCAC GATGCGTCCG GCGTAGAGGA TCGAGATCTC 
661 GATCCCGCGA AATTAATACG ACTCACTATA GGGAGACCAC AACGGTTTCC CTCTAGATCA 
721 CAAGTT TGTA CAAAAAAGCT GAACGAGAAA CGTAAAATGA TATAAATATC AATATATTAA 
781 ATTAGATTTT GCATAAAAAA CAGACTACAT AATACTGTAA AACACAACAT ATCCAGTCAC 
841 TATGG CGGC C GCATTAGGCA CCCCAGGCTT TACACTTTAT GCTTCCGGCT CGTATAATGT 
901 GTGGATTTTG AGTTAGGATC CGGCGAGATT TTCAGGAGCT AAGGAAGCTA AAATGGAGAA 
961 AAAAATCACT GGATATACCA CCGTTGATAT ATCCCAATGG CATCGTAAAG AACATTTTGA 
1021 G GCATTT CAG TCAGTTGCTC AATGTACCTA TAACCAGACC GTTCAGCTGG ATATTACGGC 
1081 CI I I I IAAAG ACCGTAAAGA AAAATAAGCA CAAGTTTTAT CCGGCCTTTA TTCACATTCT 
1141 TGCCCGCCTG ATGAATGCTC ATCCGGAATT CCGTATGGCA ATGAAAGACG GTGAGCTGGT 
1201 GATATGGGAT AGTGTTCACC CTTGTTACAC CGTTTTCCAT GAGCAAACTG AAACGTTTTC 
1261 ATCGCTCTGG AGTGAATACC ACGACGATTT CCGGCAGTTT CTACACATAT ATTCGCAAGA 
1321 TGTGGCGTGT TACGGTGAAA ACCTGGCCTA TTTCCCTAAA GGGTTTATTG AGAATATGTT 
1381 TTTCGTCTCA GCCAATCCCT GGGTGAGTTT CACCAGTTTT GATTTAAACG TGGCCAATAT 
1441 GGACAACTTC TTCGCCCCCG TTTTCACCAT GGGCAAATAT TATACGCAAG GCGACAAGGT 
1501 GCTGATGCCG CTGGCGATTC AGGTTCATCA TGCCGTCTGT GATGGCTTCC ATGTCGGCAG 
1561 AATGCTTAAT GAATTACAAC AGTACTGCGA TGAGTGGCAG GGCGGGGCGT AAACGCGTGG 
1621 ATCCGGCTTA CTAAAAGCCA GATAACAGTA TGCGTATTTG CGCGCTGATT TTTGCGGTAT 
1681 AAGAATATAT ACTGATATGT ATACCCGAAG TATGTCAAAA AGAGGTGTGC TATGAAGCAG 
1741 CGTATTACAG TGACAGTTGA CAGCGACAGC TATCAGTTGC TCAAGGCATA TATGATGTCA 
1801 ATATCTCCGG TCTGGTAAGC ACAACCATGC AGMTGAAGC CCGTCGTCTG CGTGCCGAAC 
1861 GCTG GAAA GC GGAAAATCAG GAAGGGATGG CTGAGGTCGC CCGGTTTATT GAAATGAACG 
1921 GCTCTTTTGC TGACGAGAAC AGGGACTGGT GAAATGCAGT TTAAGGTTTA CACCTATAAA 
1981 AGAGAGAGCC GTTATCGTCT GTTTGTGGAT GTACAGAGTG ATATTATTGA CACGCCCGGG 
2041 CGACGGATGG TGATCCCCCT GGCCAGTGCA CGTCTGCTGT CAGATAAAGT CTCCCGTGAA 
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2101 CTTTACCCGG TGGTGCATAT CGGGGATGAA 
2161 AGTGTGCCGG TCTCCGTTAT CGGGGAAGAA 
2221 ATCAAAAACG CCATTAACCT GATGTTCTGG 
2281 AGCCAGTCTG CAGGTCGACC ATAGTGACTG 
2341 CTGI 1 1 1 1 IA TGCAAAATCT AATTTAATAT 
2401 TTC AGCT TTC TTGTACAAAG TGGTGATTAT 
2461 CAGTTTTGAC ACGGATGTAC TCAAAGCGGA 
2521 GTGGTGCGGT CCGTGCAAAA TGATCGCCCC 
2581 GGGCAAACTG ACCGTTGCAA AACTGAACAT 
2641 TGGCATCCGT GGTATCCCGA CTCTGCTGCT 
2701 AGTGGGTGCA CTGTCTAAAG GTCAGTTGAA 
2761 TGGTTCTGGT GATGACGATG ACAAGGTACC 
2821 GAAAGGAAGC TGAGTTGGCT GCTGCCACCG 
2881 CCTCTAAACG GGTCTTGAGG GG I 1 1 1 1 I GC 
2941 AGGACGGGTG TGGTCGCCAT GATCGCGTAG 
3001 AGGACTGGGC GGCGGCCAAA GCGGTCGGAC 
3061 TGCATCAACG CATATAGCGC TAGCAGCACG 
3121 ACGATATCCC GCAAGAGGCC CGGCAGTACC 
3181 AGGGTGACGG TGCCGAGGAT GACGATGAGC 
3241 CTGCGTTAGC AATTTAACTG TGATAAACTA 
3301 CAAACATGAG AATTCTTGAA GACGAAAGGG 
3361 TGTCATGATA ATAATGGTTT CTTAGACGTC 
3421 AACCCCTATT TGTTTATTTT TCTAAATACA 
3481 ACCCTGATAA ATGCTTCAAT AATATTGAAA 
3541 TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 
3601 GCTGGTGAAA GTAAAAGATG CTGAAGATCA 
3661 GGATC TCAAC AGCGGTAAGA TCCTTGAGAG 
3721 GAGCACTTTT AAAGTTCTGC TATGTGGCGC 
3781 GCAACTCGGT CGCCGCATAC ACTATTCTCA 
3841 AGAAAAGCAT CTTACGGATG GCATGACAGT 
3901 GAG TGATAA C ACTGCGGCCA ACTTACTTCT 
3961 CGCTTTTTTG CACAACATGG GGGATCATGT 
4021 GAATGAAGCC ATACCAAACG ACGAGCGTGA 
4081 GTTGCGCAAA CTATTAACTG GCGAACTACT 
4141 CTGGATGGAG GCGGATAAAG TTGCAGGACC 
4201 GTTTATTGCT GATAAATCTG GAGCCGGTGA 
4261 GGGGCCAGAT GGTAAGCCCT CCCGTATCGT 
4321 TATGGATGAA CGAAATAGAC AGATCGCTGA 
4381 ACTGTCAGAC CAAGTTTACT CATATATACT 
4441 TAAAA GGATC TAGGTGAAGA TCCTTTTTGA 
4501 GTTTTCGTTC CACTGAGCGT CAGACCCCGT 
4561 TTTTTTTCTG CGCGTAATCT GCTGCTTGCA 
4621 TTGTTTGCCG GATCAAGAGC TACCAACTCT 
4681 GCAGATACCA AATACTGTCC TTCTAGTGTA 
4741 TGTAGCACCG CCTACATACC TCGCTCTGCT 
4801 CGATAAGTCG TGTCTTACCG GGTTGGACTC 



AGCTGGCGCA TGATGACCAC CGATATGGCC 
GTGGCTGATC TCAGCCACCG CGAAAATGAC 
GGAATATAAA TGTCAGGCTC CCTTATACAC 
GATATGTTGT GTTTTACAGT ATTATGTAGT 
ATTGATATTT ATATCATTTT ACGTTTCTCG 
GAGCGATAAA ATTATTCACC TGACTGACGA 
CGGGGCGATC CTCGTCGATT TCTGGGCAGA 
GATTCTGGAT GAAATCGCTG ACGAATATCA 
CGATCAAAAC CCTGGCACTG CGCCGAAATA 
GTTCAAAAAC GGTGAAGTGG CGGCAACCAA 
AGAGTTCCTC GACGCTAACC TGGCCGGTTC 
CGGGGATCGA TCCGGCTGCT AACAAAGCCC 
CTGAGCAATA ACTAGCATAA CCCCTTGGGG 
TGAAAGGAGG AACTATATCC GGATATCCAC 
TCGATAGTGG CTCCAAGTAG CGAAGCGAGC 
AGTGCTCCGA GAACGGGTGC GCATAGAAAT 
CCATAGTGAC TGGCGATGCT GTCGGAATGG 
GGCATAACCA AGCCTATGCC TACAGCATCC 
GCATTGTTAG ATTTCATACA CGGTGCCTGA 
CCGCATTAAA GCTTATCGAT GATAAGCTGT 
CCTCGTGATA CGCCTATTTT TATAGGTTAA 
AGGTGGCACT TTTCGGGGAA ATGTGCGCGG 
TTCAAATATG TATCCGCTCA TGAGACAATA 
AAGGAAGAGT ATGAGT ATTC AACATTTCCG 
TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC 
GTTG GGTGCA CGAGTGGGTT ACATCGAACT 
TTTTCGCCCC GAAGAACGTT TTCCAATGAT 
GGTATTATCC CGTGTTGACG CCGGGCAAGA 
GAATGACTTG GTTGAGTACT CACCAGTCAC 
AAGAGAATTA TGCAGTGCTG CCATAACCAT 
GACAACGATC GGAGGACCGA AGGAGCTAAC 
AACTCGCCTT GATCGTTGGG AACCGGAGCT 
CACCACGATG CCTGCAGCAA TGGCAACAAC 
TACTCTAGCT TCCCGGCAAC AATTAATAGA 
ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG 
GCGTGGGTCT CGCGGTATCA TTGCAGCACT 
AGTTATCTAC ACGACGGGGA GTCAGGCAAC 
GATAGGTGCC TCACTGATTA AGCATTGGTA 
TTAGATTGAT TTAAAACTTC ATTTTTAATT 
TAATCTCATG ACCAAAATCC CTTAACGTGA 
AGAAAAGATC AAAGGATCTT CTTGAGATCC 
AACAA AAAAA CCACCGCTAC CAGCGGTGGT 
TTTTCCGAAG GTAACTGGCT TCAGCAGAGC 
GCCGTAGTTA GGCCACCACT TCAAGAACTC 
AATCCTGTTA CCAGTGGCTG CTGCCAGTGG 
AAGACGATAG TTACCGGATA AGGCGCAGCG 
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4861 GTC6GGCTGA ACGGGGGGTT CGTGCACACA 
4921 ACTGAGATAC CTACAGCGTG AGCTATGAGA 
4981 GGACAGGTAT CCGGTAAGCG GCAGGGTCGG 
5041 GGGAAACGCC TGGTATCTTT ATAGTCCTGT 
5101 ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG 
5161 TTTACGGTTC CTGGCCTTTT GCTGGCCTTT 
5221 TGATTCTGTG GATAACCGTA TTACCGCCTT 
5281 AACGACCGAG CGCAGCGAGT CAGTGAGCGA 
5341 TCTCCTTACG CATCTGTGCG GTATTTCACA 
5401 TGCTCTGATG CCGCATAGTT AAGCCAGTAT 
5461 TGGCTGCGCC CCGACACCCG CCAACACCCG 
5521 CGGCATCCGC TTACAGACAA GCTGTGACCG 
5581 CACCGTCATC ACCGAAACGC GCGAGGCAGC 
5641 GCGATTCACA GATGTCTGCC TGTTCATCCG 
5701 TTAATGTCTG GCTTCTGATA AAGCGGGCCA 
5761 CTGATGCCTC CGTGTAAGGG GGATTTCTGT 
5821 AGAGGATGCT CACGATACGG GTTACTGATG 
5881 AGGGTAAACA ACTGGCGGTA TGGATGCGGC 
5941 GCCAGCGCTT CGTTAATACA GATGTAGGTG 
6001 TGCAGATCCG GAACATAATG GTGCAGGGCG 
6061 CACGGAAACC GAAGACCATT CATGTTGTTG 
6121 CGCTTCACGT TCGCTCGCGT ATCGGTGATT 
6181 AGCCTAGCCG GGTCCTCAAC GACAGGAGCA 
6241 CGCTGCCCGA GATGCGCCGC GTGCGGCTGC 
6301 GCCAAGGGTT GGTTTGCGCA TTCACAGTTC 
6361 GAGTGGTGAA TCCGTTAGCG AGGTGCCGCC 
6421 CCATGCACCG CGACGCAACG CGGGGAGGCA 
6481 TGCCAACCCG TTCCATGTGC TCGCCGAGGC 
6541 AGTGATCGAA GTTAGGCTGG TAAGAGCCGC 
6601 GTCATCTACC TGCCTGGACA GCATGGCCTG 



GCCCAGCTTG GAGCGAACGA CCTACACCGA 
AAGCGCCACG CTTCCCGAAG GGAGAAAGGC 
AACAGGAGAG CGCACGAGGG AGCTTCCAGG 
CGGGTTTCGC CACCTCTGAC TTGAGCGTCG 
CCTATGGAAA AACGCCAGCA ACGCGGCCTT 
TGCTCACATG TTCTTTCCTG CGTTATCCCC 
TGAGTGAGCT GATACCGCTC GCCGCAGCCG 
GGAAGCGGAA GAGCGCCTGA TGCGGTATTT 
CCGCATATAT GGTGCACTCT CAGTACAATC 
ACACTCCGCT ATCGCTACGT GACTGGGTCA 
CTGACGCGCC CTGACGGGCT TGTCTGCTCC 
TCTCCGGGAG CTGCATGTGT CAGAGGTTTT 
TGCGGTAAAG CTCATCAGCG TGGTCGTGAA 
CGTCCAGCTC GTTGAGTTTC TCCAGAAGCG 
TGTTAAGGGC GGI I I I I ICC TGTTTGGTCA 
TCATGGGGGT AATGATACCG ATGAAACGAG 
ATGAACATGC CCGGTTACTG GAACGTTGTG 
GGGACCAGAG AAAAATCACT CAGGGTCAAT 
TTCCACAGGG TAGCCAGCAG CATCCTGCGA 
CTGACTTCCG CGTTTCCAGA CTTTACGAAA 
CTCAGGTCGC AGACGTTTTG CAGCAGCAGT 
CATTCTGCTA ACCAGTAAGG CAACCCCGCC 
CGATCATGCG CACCCGTGGC CAGGACCCAA 
TGGAGATGGC GGACGCGATG GATATGTTCT 
TCCGCAAGAA TTGATTGGCT CCAATTCTTG 
GGCTTCCATT CAGGTCGAGG TGGCCCGGCT 
GACAAGGTAT AGGGCGGCGC CTACAATCCA 
GGCATAAATC GCCGTGACGA TCAGCGGTCC 
GAGCGATCCT TGAAGCTGTC CCTGATGGTC 
CAACGCGGGC ATCCCGATGC CG 
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600 ttg ocg tea atg gga gtt tgt ttt ggc acc aaa etc aac ggg act ttc caa 
aac tgc agt tac cct caa aca aaa ccg tgg ttt tag ttg ccc tga aag gtt 

651 oat gtc gta aca act ccg ccc cat tga cgc aaa tgg gcg gta ggc gtg tac 
tta cog cat tgt tga ggc ggg gta act gcg ttt acc cgc cat ccg cac atg 



CMV promoter 



702 ggt ggg agg tct ata taa gca gag etc gtt tag tga acc 
cca ccc tec ago tat att cgt etc gog coo ate act tgg 



mRNA start 



gtc 
cog 



ago teg cct 
tct age ggo 



753 gga gac gee ate cac get gtt ttg acc tec ata gaa gac acc ggg acc gat 
cct ctg egg tag gtg cga caa aac tgg agg tat ctt ctg tgg ccc tgg cto 



Start Translation 
804 cca gec tec gga etc tag cct agg ccg egg acc 



M A Y Y H H 
atg gcg tac tac cat cac 



ggt egg agg cct gag ate gga tec ggc gec tgg tac cgc atg atg gta gtg 



H H H H S R S 
855 cat cac cat cac tct aga tea 
gta gtg gta gtg aga tct agt 



T S 



K K 



attR1 



aca agt tt jg tac aaa aaa get gaa cga gaa 
tgt tea aac atg tttlttt cga ctt get ctt 

nt 1 



Ii 



flspLUII I 7092 



Ail III 7092 



Sst I 518 
Bbs I 581 



Ssf II 623 
Xba I 662 
Not I 796 



Earn 1105 16199 



Xmn I 5600 




EcoR I 1117 

^PflU I 1343 

\^ Bgl I1 1 564 
. V" &/107 I 1650 



\-^Srf\ 1985 
V>^al 1986 
^ Sma I 1986 



Ws/>V 4909 
Rsr II 4743 

77/jlll 



Sal I 2245 
Nhe I 2475 



4343 

Nar I 4227 
Afl II 4082 
tf/nd III 4050 
Stu I 4031 
BseR I 4015 



Bel I 2751 
Mun I 2844 
Hpa I 2855 



Dm III 3346 
Kpn I 3684 
SexA I 3801 

Sfi I 3980 
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pDEST26 7481 bp 



Location (Base Nos.) 
492.. 509 
619.. 519 



Gene Encoded 

his6 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

SV40 polyA 

fl intergenic region 

SV40 promoter 

neo 

polyA 

Apr 

ori 

CMV promoter 



752.. 1411 
1531.. 1615 
1753.. 2058 
2099.. 2223 
2409.. 2771 
2966.. 3421 
3485.. 3903 
3948.. 4742 
4806.. 4854 
5265.. 6125 
6274.. 6913 
7344.. 385 



1 GTAAACTGCC CACTTGGCAG TACATCAAGT GTATCATATG CCAAGTACGC CCCCTATTGA 
61 CGTCAATGAC GGTAAATGGC CCGCCTGGCA TTATGCCCAG TACATGACCT TATGGGACTT 
121 TCCTACTTGG CAGTACATCT ACGTATTAGT CATCGCTATT ACCATGGTGA TGCGGTTTTG 
181 GCAGTACATC AATGGGCGTG GATAGCGGTT TGACTCACGG GGATTTCCAA GTCTCCACCC 
241 CATTGACGTC AATGGGAGTT TGTTTTGGCA CCAAAATCAA CGGGACTTTC CAAAATGTCG 
301 TAACAACTCC GCCCCATTGA CGCAAATGGG CGGTAGGCGT GTACGGTGGG AGGTCTATAT 
361 AAGCAGAGCT CGTTTAGTGA ACCGTCAGAT CGCCTGGAGA CGCCATCCAC GCTGTTTTGA 
421 CCTCCATAGA AGACACCGGG ACCGATCCAG CCTCCGGACT CTAGCCTAGG CCGCGGACCA 
481 TGGCGTACTA CCATCACCAT CACCATCAGT CTAGATCAAC AAGTTTGTAC AAAAAAGCTG 
541 AACGAGAAAC GTAAMTGAT ATAAATATCA ATATATTAAA TTAGATTTTG CATAAAAAAC 
601 AGACTACATA ATACTGTAAA ACACAACATA TCCAGTCACT ATGGCGGCCG CATTAGGCAC 
661 CCCAGGCTTT ACACTTTATG CTTCCGGCTC GTATAATGTG TGGATTTTGA GTTAGGATCC 
721 GGCGAGATTT TCAGGAGCTA AGGAAGCTAA AATGGAGAAA AAAATCACTG GATATACCAC 
781 CGTTGATATA TCCCAATGGC ATCGTAAAGA ACATTTTGAG GCATTTCAGT CAGTTGCTCA 
841 ATGTACCTAT AACCAGACCG TTCAGCTGGA TATTACGGCC TTTTTAAAGA CCGTAAAGAA 
901 AAATAAGCAC AAGTTTTATC CGGCCTTTAT TCACATTCTT GCCCGCCTGA TGAATGCTCA 
961 TCCGGAATTC CGTATGGCAA TGAAAGACGG TGAGCTGGTG ATATGGGATA GTGTTCACCC 
1021 TTGTTACACC GTTTTCCATG AGCAAACTGA AACGTTTTCA TCGCTCTGGA GTGAATACCA 
1081 CGACGATTTC CGGCAGTTTC TACACATATA TTCGCAAGAT GTGGCGTGTT ACGGTGAAAA 
1141 CCTGGCCTAT TTCCCTAAAG GGTTTATTGA GAATATGTTT TTCGTCTCAG CCAATCCCTG 
1201 GGTGAGTTTC ACCAGTTTTG ATTTAAACGT GGCCAATATG GACAACTTCT TCGCCCCCGT 
1261 TTTCACCATG GGCAAATATT ATACGCAAGG CGACAAGGTG CTGATGCCGC TGGCGATTCA 
1321 GGTTCATCAT GCCGTCTGTG ATGGCTTCCA TGTCGGCAGA ATGCTTAATG AATTACAACA 
1381 GTACTGCGAT GAGTGGCAGG GCGGGGCGTA MGATCTGGA TCCGGCTTAC TAAAAGCCAG 
1441 ATAACAGTAT GCGTATTTGC GCGCTGATTT TTGCGGTATA AGAATATATA CTGATATGTA 
1501 TACCCGAAGT ATGTCAAAAA GAGGTGTGCT ATGAAGCAGC GTATTACAGT GACAGTTGAC 
1561 AGCGACAGCT ATCAGTTGCT CAAGGCATAT ATGATGTCAA TATCTCCGGT CTGGTAAGCA 
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1621 CAACCATGCA GAATGAAGCC CGTCGTCTGC 
1681 AAGGGATGGC TGAGGTCGCC CGGTTTATTG 
1741 GGGACTGGTG AAATGCAGTT TAAGGTTTAC 
1801 TTTGTGGATG TACAGAGTGA TATTATTGAC 
1861 GCCAGTGCAC GTCTGCTGTC AGATAAAGTC 
1921 GGGGATGAAA GCTGGCGCAT GATGACCACC 
1981 GGGGAAGAAG TGGCTGATCT CAGCCACCGC 
2041 ATGTTCTGGG GAATATAAAT GTCAGGCTCC 
2101 TAGTGACTGG ATATGTTGTG TTTTACAGTA 
2161 ATTTAATATA TTGATATTTA TATCATTTTA 
2221 GGTTGATCGC GTGCATGCGA CGTCATAGCT 
2281 AGGCACTGGC CGTCGTTTTA CAACGTCGTG 
2341 TGAAGGAACC TTACTTCTGT GGTGTGACAT 
2401 GCTCTAAGGT AAATATAAAA TTTTTAAGtG 
2461 GCTGCTTGAG AGTTTTGCTT ACTGAGTATG 
2521 TGATTCTAAT TGTTTGTGTA TTTTAGATTC 
2581 GTCCTCACAG TCTGTTCATG ATCATAATCA 
2641 CTTTAAAAAA CCTCCCACAC CTCCCCCTGA 
2701 TTGTTAACTT GTTTATTGCA GCTTATAATG 
2761 TCACAAATAA AGCAI I 1 1 1 1 TCACTGCATT 
2821 TATCTTATCA TGTCTGGATC GATCCTGCAT 
2881 GGTTTGCGTA TTGGCTGGCG TAATAGCGAA 
2941 TTGCGCAGCC TGAATGGCGA ATGGGACGCG 
3001 GTGGTGGTTA CGCGCAGCGT GACCGCTACA 
3061 GCTTTCTTCC CTTCCTTTCT CGCCACGTTC 
3121 GGGCTCCCTT TAGGGTTCCG ATTTAGTGCT 
3181 TAGGGTGATG GTTCACGTAG TGGGCCATCG 
3241 TTGGAGTCCA CGTTCTTTAA TAGTGGACTC 
3301 ATCTCGGTCT ATTCTTTTGA TTTATAAGGG 
3361 AATGAGCTGA TTTAACAAAT ATTTAACGCG 
3421 TCGCCTGATG CGGTATTTTC TCCTTACGCA 
3481 ATCTGCGCAG CACCATGGCC TGAAATAACC 
3541 TGAGGCGGAA AGAACCAGCT GTGGAATGTG 
3601 TCCCCAGCAG GCAGAAGTAT GCAAAGCATG 
3661 AAGTCCCCAG GCTCCCCAGC AGGCAGAAGT 
3721 ACCATAGTCC CGCCCCTAAC TCCGCCCATC 
3781 TCTCCGCCCC ATGGCTGACT MM I II I 1 1 
3841 TCTGAGCTAT TCCAGAAGTA GTGAGGAGGC 
3901 CTTGATTCTT CTGACACAAC AGTCTCGAAC 
3961 ATGGATTGCA CGCAGGTTCT CCGGCCGCTT 
4021 CACAACAGAC AATCGGCTGC TCTGATGCCG 
4081 CGGTTCTTTT TGTCAAGACC GACCTGTCCG 
4141 CGCGGCTATC GTGGCTGGCC ACGACGGGCG 
4201 CTGAAGCGGG AAGGGACTGG CTGCTATTGG 
4261 CTCACCTTGC TCCTGCCGAG AAAGTATCCA 
4321 CGCTTGATCC GGCTACCTGC CCATTCGACC 



GTGCCGAACG CTGGAAAGCG GAAAATCAGG 
AAATGAACGG CTCTTTTGCT GACGAGAACA 
ACCTATAAAA GAGAGAGCCG TTATCGTCTG 
ACGCCCGGGC GACGGATGGT GATCCCCCTG 
TCCCGTGAAC TTTACCCGGT GGTGCATATC 
GATATGGCCA GTGTGCCGGT CTCCGTTATC 
GAAAATGACA TCAAAAACGC CATTAACCTG 
CTTATACACA GCCAGTCTGG AGGTCGACCA 
TTATGTAGTC TGI 1 1 1 1 1 AT GCAAAATCTA 
CGTTTCTCGT TCAGCTTTCT TGTACAAAGT 
CTCTCCCTAT AGTGAGTCGT ATTATAAGCT 
ACTGGGAAAA CTGCTAGCTT GGGATCTTTG 
AATTGGACAA ACTACCTACA GAGATTTAAA 
TATAATGTGT TAAACTAGCT GCATATGCTT 
ATTTATGAAA ATATTATACA CAGGAGCTAG 
ACAGTCCCAA GGCTCATTTC AGGCCCCTCA 
GCCATACCAC ATTTGTAGAG GTTTTACTTG 
ACCTGAAACA TAAAATGAAT GCAATTGTTG 
GTTACAAATA AAGCAATAGC ATCACAAATT 
CTAGTTGTGG TTTGTCCAAA CTCATCAATG 
TAATGAATCG GCCAACGCGC GGGGAGAGGC 
GAGGCCCGCA CCGATCGCCC TTCCCAACAG 
CCCTGTAGCG GCGCATTAAG CGCGGCGGGT 
CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC 
GCCGGCTTTC CCCGTCAAGC TCTAAATCGG 
TTACGGCACC TCG ACCCC AA AAAACTTGAT 
CCCTGATAGA CGGTTTTTCG CCCTTTGACG 
T TGTT CCAAA CTGGAACAAC ACTCAACCCT 
A TTTTG CCGA TTTCGGCCTA TTGGTTAAAA 
AATTTTAACA AAATATTAAC GTTTACAATT 
TCTGTGCGGT ATTTCACACC GCATACGCGG 
TCTGAAAGAG GAACTTGGTT AGGTACCTTC 
TGTCAGTTAG GGTGTGGAAA GTCCCCAGGC 
CATCTCAATT AGTCAGCAAC CAGGTGTGGA 
ATGCAAAGCA TGCATCTCAA TTAGTCAGCA 
CCGCCCCTAA CTCCGCCCAG TTCCGCCCAT 
ATTTAT GCAG AGGCCGAGGC CGCCTCGGCC 
1 1 1 1 I IGGAG GCCTAGGCTT TTGCAAAAAG 
TTAAGGCTAG AGCCACCATG ATTGAACAAG 
GGGTGGAGAG GCTATTCGGC TATGACTGGG 
CCGTGTTCCG GCTGTCAGCG CAGGGGCGCC 
GTGCCCTGAA TGAACTGCAG GACGAGGCAG 
TTCCTTGCGC AGCTGTGCTC GACGTTGTCA 
GCGAAGTGCC GGGGCAGGAT CTCCTGTCAT 
TCATGGCTGA TGCAATGCGG CGGCTGCATA 
ACCAAGCGAA ACATCGCATC GAGCGAGCAC 
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4381 GTACTCGGAT GGAAGCCGGT CTTGTCGAtC AGGATGATCT GGACGAAGAG CATCAGGGGC 
4441 TCGCGCCAGC CGAACTGTTC GCCAGGCTCA AGGCGCGCAT GCCCGACGGC GAGGATCTCG 
4501 TCGTGACCCA TGGCGATGCC TGCTTGCCGA ATATCATGGT GGAAAATGGC CGCTTTTCTG 
4561 GATTCATCGA CTGTGGCCGG CTGGGTGTGG CGGACCGCTA TCAGGACATA GCGTTGGCTA 
4621 CCCGTGATAT TGCTGAAGAG CTTGGCGGCG AATGGGCTGA CCGCTTCCTC GTGCTTTACG 
4681 GTATCGCCGC TCCCGATTCG CAGCGCATCG CCTTCTATCG CCTTCTTGAC GAGTTCTTCT 
4741 GAGCGGGACT CTGGGGTTCG AAATGACCGA CCAAGCGACG CCCAACCTGC CATCACGATG 
4801 GCCGCAATAA AATATCTTTA TTTTCATTAC ATCTGTGTGT TGGTTTTTTG TGTGAATCGA 
4861 TAGCGATAAG GATCCGCGTA TGGTGCACTC TCAGTACAAT CTGCTCTGAT GCCGCATAGT 
4921 TAAGCCAGCC CCGACACCCG CCAACACCCG CTGACGCGCC CTGACGGGCT TGTCTGCTCC 
4981 CGGCATCCGC TTACAGACAA GCTGTGACCG TCTCCGGGAG CTGCATGTGT CAGAGGTTTT 
5041 CACCGTCATC ACCGAAACGC GCGAGACGAA AGGGCCTCGT GATACGCCTA TTTTTATAGG 
5101 TTAATGTCAT GATAATAATG GTTTC TTAGA CGTCAGGTGG CACTTTTCGG GGAAATGTGC 
5161 GCGGAACCCC TATTTGTTTA TTTTTCTAAA TACATTCAAA TATGTATCCG CTCATGAGAC 
5221 AATAACCCTG ATAAATGCTT CAATAATATT GAAAAAGGAA GAGTATGAGT ATTCAACATT 
5281 TCCGTGTCGC CCTTATTCCC 1 1 1 1 1 1 GCGG CATTTTGCCT TCCTGTTTTT GCTCACCCAG 
5341 AAACGCTGGT GAAAGTAAAA GATGCTGAAG ATCAGTTGGG TGCACGAGTG GGTTACATCG 
5401 AACTGGATCT CAACAGCGGT AAGATCCTTG AGAGTTTTCG CCCCGAAGAA CGTTTTCCAA 
5461 TGATGAGCAC TTTTAAAGTT CTGCTATGTG GCGCGGTATT ATCCCGTATT GACGCCGGGC 
5521 AAGAGCAACT CGGTCGCCGC ATACACTATT CTCAGAATGA CTTGGTTGAG TACTCACCAG 
5581 TCACAGAAAA GCATCTTACG GATGGCATGA CAGTAAGAGA ATTATGCAGT GCTGCCATAA 
5641 CCATGAGTGA TAACACTGCG GCCAACTTAC TTCTGACAAC GATCGGAGGA CCGAAGGAGC 
5701 TAACCGCTTT TTTGCACAAC ATGGGGGATC ATGTAACTCG CCTTGATCGT TGGGAACCGG 
5761 AGCTGAATGA AGCCATACCA AACGACGAGC GTGACACCAC GATGCCTGTA GCAATGGCAA 
5821 CAACGTTGCG CAAACTATTA ACTGGCGAAC TACTTACTCT AGCTTCCCGG CAACAATTAA 
5881 TAGACTGGAT GGAGGCGGAT AAAGTTGCAG GACCACTTCT GCGCTCGGCC CTTCCGGCTG 
5941 GCTGGTTTAT TGCTGATAAA TCTGGAGCCG GTGAGCGTGG GTCTCGCGGT ATCATTGCAG 
6001 CACTGGGGCC AGATGGTAAG CCCTCCCGTA TCGTAGTTAT CTACACGACG GGGAGTCAGG 
6061 CAACTATGGA TGAACGAAAT AGACAGATCG CTGAGATAGG TGCCTCACTG ATTAAGCATT 
6121 GGTAACTGTC AGACCAAGTT TACTCATATA TACTTTAGAT TGATTTAAAA CTTCATTTTT 
6181 AATTTAAAAG GATCTAGGTG AAGATCCTTT TTGATAATCT CATGACCAAA ATCCCTTAAC 
6241 GTG AGTTTTC GTTCCACTGA GCGTCAGACC CCGTAGAAAA GATCAAAGGA TCTTCTTGAG 
6301 ATCCIIIIII TCTGCGCGTA ATCTGCTGCT TGC AAACAA A AAAACCACCG CTACCAGCGG 
6361 TGGTTTGTTT GCCGGATCAA GAGCTACCAA CTCTTTTTCC GAAGGTAACT GGCTTCAGCA 
6421 GAGCGCAGAT ACCAAATACT GTCCTTCTAG TGTAGCCGTA GTTAGGCCAC CACTTCAAGA 
6481 ACTCTGTAGC ACCGCCTACA TACCTCGCTC TGCTAATCCT GTTACCAGTG GCTGCTGCCA 
6541 GTGGCGATAA GTCGTGTCTT ACCGGGTTGG ACTCAAGACG ATAGTTACCG GATAAGGCGC 
6601 AGCGGTCGGG CTGAACGGGG GGTTCGTGCA CACAGCCCAG CTTGGAGCGA ACGACCTACA 
6661 CCGAACTGAG ATACCTACAG CGTGAGCATT GAGAAAGCGC CACGCTTCCC GAAGGGAGAA 
6721 AGGCGGACAG GTATCCGGTA AGCGGCAGGG TCGGAACAGG AGAGCGCACG AGGGAGCTTC 
6781 CAGGG GGAAA CGCCTGGTAT CTTTATAGTC CTGTCGGGTT TCGCCACCTC TGACTTGAGC 
6841 G TCGATTT TT GTGATGCTCG TCAGGGGGGC GGAGCCTATG GAAAAACGCC AGCAACGCGG 
6901 CCTTTTTACG GTTCCTGGCC TTTTGCTGGC CTTTTGCTCA CATGTTCTTT CCTGCGTTAT 
6961 CCCCTGATTC TGTGGATAAC CGTATTACCG CCTTTGAGTG AGCTGATACC GCTCGCCGCA 
7021 GCCGAACGAC CGAGCGCAGC GAGTCAGTGA GCGAGGAAGC GGAAGAGCGC CCAATACGCA 
7081 AACCGCCTCT CCCCGCGCGT TGGCCGATTC ATTAATGCAG AGCTTGCAAT TCGCGCGTTT 
7141 TTCAATATTA TTGAAGCATT TATCAGGGTT ATTGTCTCAT GAGCGGATAC ATATTTGAAT 
7201 GTATTTAGAA AAATAAACAA ATAGGGGTTC CGCGCACATT TCCCCGAAAA GTGCCACCTG 
7261 ACGTCTAAGA AACCATTATT ATCATGACAT TAACCTATAA AAATAGGCGT AGTACGAGGC 
7321 CCTTTCACTC ATTAGATGCA TGTCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
7381 CCGCCCAACG ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 
7441 ATAGGGACTT TCCATTGACG TCAATGGGTG GAGTATTTAC G 
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pDEST27 GST AMINO FUSION IN pCMV Sport-neo VECTOR 

CMV promoter mRNA start 

oga teg 
tct age 



600 nac ggt ggg agg tct ata too gca gag etc gtt tag tga acc gtc 
ntg cca ccc tec aga tat att cgt etc gag caa ate cct tgg cag 



651 cct gga gac gee ate cac get gtt ttg acc tec ata gaa gac acc ggg acc 

gga cct ctg egg tag gtg cga caa aac tgg agg tat ctt ctg tgg ccc tgg 

~iT A P I L 

702 gat cca gec tec gga etc tag cct agg ccg egg acc atg gee cct ata eta 

eta ggt egg agg cct gag ate gga tec ggc gee tgg tac egg gga tat gat 

Start Translation GST 

753 ggt tat tgg aaa att aag ggc ctt gtg caa ccc act cga ctt ctt ttg gaa 

cca ata acc ttt too ttc ccg gaa cac gtt ggg tga get gaa gaa aac ctt 

- GST Protein - 

804 tat ctt gaa gaa aaa tat gaa gag cat ttg tat gag cgc gat gaa ggt gat 

ata gaa ctt ctt ttt ata ctt etc gta aac ata etc gcg eta ctt cca cta~ 

V P R S R 

1365 ttt ggt ggt ggc gac cat cct cca aaa teg gat ctg gtt ccg cgt tct aga 

aaa cca cca ccg ctg gta gga ggt ttt age eta gac caa ggc gca aga tct 

S T S L Y K K A 
1416 tea' 



agt 



aca agt tt |q tac aaa aaa get gaa cga gaa acg 
tgt tea aac atg tttjttt cga ctt get ctt tgc 



Int 



£a/n 1105 16841 
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Rsr II 5385 



r/M11 I 4985 
Nar I 4869 
Att II 4724 
Hind III 4692 
Stu I 4673 
BseR I 4657 



Sf/14622 SexA I 4443 



Drolll 3988 
Kpn I 4326 



Sst I 518 
56s I 581 
Sst II 623 
EcoH I 638 
Bsg I 897 
SwalU 056 

1304 
1438 

1759 

1985 

Bgl 112206 
Bs/1107 I 2292 

Srf\ 2627 
Ava I 2628 
Sma 1 2628 

Sal I 2887 
Me I 31 17 
Mun I 3486 
Hpa I 3497 



147/253 



pDEST27 8123 bp (rotated to position 7800) 



Location (Base Nos. ) 
130 . . 793 
803 . . 927 



Gene Encoded 

GST 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

SV40 polyA 

fl intergenic region 

SV40 promoter 

neo 

polyA 

Apr 

ori 

CMV promoter 



1036.. 1695 
1815 . . 1899 
2037.. 2342 
2383.. 2507 
2693.. 3055 
3250.. 3705 
3769.. 4187 
4232 . . 5026 
5090. .5138 
5549 . . 6409 
6558. .7197 
7628. .27 



1 ATAAGCAGAG CTCGTTTAGT GAACCGTCAG ATCGCCTGGA GACGCCATCC ACGCTGTTTT 
61 GACCTCCATA GAAGACACCG GGACCGATCC AGCCTCCGGA CTCTAGCCTA GGCCGCGGAC 
121 CATGGCCCCT ATACTAGGTT ATTGGAAAAT TAAGGGCCTT GTGCAACCCA CTCGACTTCT 
181 TTTGGAATAT CTTGAAGAAA AATATGAAGA GCATTTGTAT GAGCGCGATG AAGGTGATAA 
241 ATGGCGAAAC AAAAAGTTTG AATTGGGTTT GGAGTTTCCC AATCTTCCTT ATTATATTGA 
301 TGGTGATGTT AAATTAACAC AGTCTATGGC CATCATACGT TATATAGCTG ACAAGCACAA 
361 CATGTTGGGT GGTTGTCCAA AAGAGCGTGC AGAGATTTCA ATGCTTGAAG GAGCGGTTTT 
421 GGATATTAGA TACGGTGTTT CGAGAATTGC ATATAGTAAA GACTTTGAAA CTCTCAAAGT 
481 TGATTTTCTT AGCAAGCTAC CTGAAATGCT GAAAATGTTC GAAGATCGTT TATGTCATAA 
541 AACATATTTA AATGGTGATC ATGTAACCCA TCCTGACTTC ATGTTGTATG ACGCTCTTGA 
601 TGTTGTTTTA TACATGGACC CAATGTGCCT GGATGCGTTC CCAAAATTAG TTTGTTTTAA 
661 AAAACGTATT GAAGCTATCC CACAAATTGA TAAGTACTTG AAATCCAGCA AGTATATAGC 
721 ATGGCCTTTG CAGGGCTGGC AAGCCACGTT TGGTGGTGGC GACCATCCTC CAAAATCGGA 
781 TCTGGTTCCG CGTTCTAGAT CAACAAGTTT GTACAAAAAA GCTGAACGAG AAACGTAAAA 
841 TGATATAAAT ATCAATATAT TAAATTAGAT TTTGCATAAA AAACAGACTA CATAATACTG 
901 TAAAACACAA CATATCCAGT CACTATGGCG GCCGCATTAG GCACCCCAGG CTTTACACTT 
961 TATGCTTCCG GCTCGTATAA TGTGTGGATT TTGAGTTAGG ATCCGGCGAG ATTTTCAGGA 
1021 GCTAAGGAAG CTAAAATGGA GAAAAAAATC ACTGGATATA CCACCGTTGA TATATCCCAA 
1081 TGGCATCGTA AAGAACATTT TGAGGCATTT CAGTCAGTTG CTCAATGTAC CTATAACCAG 
1141 ACCGTTCAGC TGGATATTAC GGCCTTTTTA AAGACCGTAA AGAAAAATAA GCACAAGTTT 
1201 TATCCGGCCT TTATTCACAT TCTTGCCCGC CTGATGAATG CTCATCCGGA ATTCCGTATG 
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1261 GCAAT6AAAG ACGGTGAGCT GGTGATATGG GATAGTGTTC ACCCTTGTTA CACCGTTTTC 
1321 CATGAGCAAA CTGAAACGTT TTCATCGCTC TGGAGTGAAT ACCACGACGA TTTCCGGCAG 
1381 TTTCTACACA TATATTCGCA AGATGTGGCG TGTTACGGTG AAAACCTGGC CTATTTCCCT 
1441 AAAGGGTTTA TTGAGAATAT GTTTTTCGTC TCAGCCAATC CCTGGGTGAG TTTCACCAGT 
1501 TTTGATTTAA ACGTGGCCAA TATGGACAAC TTCTTCGCCC CCGTTTTCAC CATGGGCAAA 
1561 TATTATACGC AAGGCGACAA GGTGCTGATG CCGCTGGCGA TTCAGGTTCA TCATGCCGTC 
1621 TGTGATGGCT TCCATGTCGG CAGAATGCTT AATGAATTAC AACAGTACTG CGATGAGTGG 
1681 CAGGGCGGGG CGTAAAGATC TGGATCCGGC TTACTAAAAG CCAGATAACA GTATGCGTAT 
1741 TTGCGCGCTG ATTTTTGCGG TATAAGAATA TATACTGATA TGTATACCCG AAGTATGTCA 
1801 AAAAGAGGTG TGCTATGAAG CAGCGTATTA CAGTGACAGT TGACAGCGAC AGCTATCAGT 
1861 TGCTCAAGGC ATATATGATG TCAATATCTC CGGTCTGGTA AGCACAACCA TGCAGAATGA 
1921 AGCCCGTCGT CTGCGTGCCG AACGCTGGAA AGCGGAAAAT CAGGAAGGGA TGGCTGAGGT 
1981 CGCCCGGTTT ATTGAAATGA ACGGCTCTTT TGCTGACGAG AACAGGGACT GGTGAAATGC 
2041 AGTTTAAGGT TTACACCTAT AAAAGAGAGA GCCGTTATCG TCTGTTTGTG GATGTACAGA 
2101 GTGATATTAT TGACACGCCC GGGCGACGGA TGGTGATCCC CCTGGCCAGT GCACGTCTGC 
2161 TGTCAGATAA AGTCTCCCGT GAACTTTACC CGGTGGTGCA TATCGGGGAT GAAAGCTGGC 
2221 GCATGATGAC CACCGATATG GCCAGTGTGC CGGTCTCCGT TATCGGGGAA GAAGTGGCTG 
2281 ATCTCAGCCA CCGCGAAAAT GACATCAAAA ACGCCATTAA CCTGATGTTC TGGGGAATAT 
2341 AAATGTCAGG CTCCCTTATA CACAGCCAGT CTGCAGGTCG ACCATAGTGA CTGGATATGT 
2401 TGTGTTTTAC AGTATTATGT AGTCTGTTTT TTATGCAAAA TCTAATTTAA TATATTGATA 
2461 TTTATATCAT TTTACGTTTC TCGTTCAGCT TTCTTGTACA AAGTGGTTGA TCGCGTGCAT 
2521 GCGACGTCAT AGCTCTCTCC CTATAGTGAG TCGTATTATA AGCTAGGCAC TGGCCGTCGT 
2581 TTTACAACGT CGTGACTGGG AAAACTGCTA GCTTGGGATC TTTGTGAAGG AACCTTACTT 
2641 CTGTGGTGTG ACATAATTGG ACAAACTACC TACAGAGATT TAAAGCTCTA AGGTAAATAT 
2701 AAAATTTTTA AGTGTATAAT GTGTTAAACT AGCTGCATAT GCTTGCTGCT TGAGAGTTTT 
2761 GCTTACTGAG TATGATTTAT GAAAATATTA TACACAGGAG CTAGTGATTC TAATTGTTTG 
2821 TGTATTTTAG ATTCACAGTC CCAAGGCTCA TTTCAGGCCC CTCAGTCCTC ACAGTCTGTT 
2881 CATGATCATA ATCAGCCATA CCACATTTGT AGAGGTTTTA CTTGCTTTAA AAAACCTCCC 
2941 ACACCTCCCC CTGAACCTGA AACATAAAAT GAATGCAATT GTTGTTGTTA ACTTGTTTAT 
3001 TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA AATTTCACAA ATAAAGCATT 
3061 TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC AATGTATCTT ATCATGTCTG 
3121 GATCGATCCT GCATTAATGA ATCGGCCAAC GCGCGGGGAG AGGCGGTTTG CGTATTGGCT 
3181 GGCGTAATAG CGAAGAGGCC CGCACCGATC GCCCTTCCCA ACAGTTGCGC AGCCTGAATG 
3241 GCGAATGGGA CGCGCCCTGT AGCGGCGCAT TAAGCGCGGC GGGTGTGGTG GTTACGCGCA 
3301 GCGTGACCGC TACACTTGCC AGCGCCCTAG CGCCCGCTCC TTTCGCTTTC TTCCCTTCCT 
3361 TTCTCGCCAC GTTCGCCGGC TTTCCCCGTC AAGCTCTAAA TCGGGGGCTC CCTTTAGGGT 
3421 TCCGATTTAG TGCTTTACGG CACCTCGACC CCAAAAAACT TGATTAGGGT GATGGTTCAC 
3481 GTAGTGGGCC ATCGCCCTGA TAGACGGTTT TTCGCCCTTT GACGTTGGAG TCCACGTTCT 
3541 TTAATAGTGG ACTCTTGTTC CAAACTGGAA CAACACTCAA CCCTATCTCG GTCTATTCTT 
3601 TTGATTTATA AGGGATTTTG CCGATTTCGG CCTATTGGTT AAAAAATGAG CTGATTTAAC 
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3661 AAATATTTAA CGCGAATTTT AACAAAATAT TAACGTTTAC AATTTCGCCT GATGCGGTAT 
3721 TTTCTCCTTA CGCATCTGTG CGGTATTTCA CACCGCATAC GCGGATCTGC GCAGCACCAT 
3781 GGCCTGAAAT AACCTCTGAA AGAGGAACTT GGTTAGGTAC CTTCTGAGGC GGAAAGAACC 
3841 AGCTGTGGAA TGTGTGTCAG TTAGGGTGTG GAAAGTCCCC AGGCTCCCCA GCAGGCAGAA 
3901 GTATGCAAAG CATGCATCTC AATTAGTCAG CAACCAGGTG TGGAAAGTCC CCAGGCTCCC 
3961 CAGCAGGCAG AAGTATGCAA AGCATGCATC TCAATTAGTC AGCAACCATA GTCCCGCCCC 
4021 TAACTCCGCC CATCCCGCCC CTAACTCCGC CCAGTTCCGC CCATTCTCCG CCCCATGGCT 
4081 GACTAATTTT TTTTATTTAT GCAGAGGCCG AGGCCGCCTC GGCCTCTGAG CTATTCCAGA 
4141 AGTAGTGAGG AGGCI I I 1 1 1 GGAGGCCTAG GCTTTTGCAA AAAGCTTGAT TCTTCTGACA 
4201 CAACAGTCTC GAACTTAAGG CTAGAGCCAC CATGATTGAA CAAGATGGAT TGCACGCAGG 
4261 TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGGCTATGAC TGGGCACAAC AGACAATCGG 
4321 CTGCTCTGAT GCCGCCGTGT TCCGGCTGTC AGCGCAGGGG CGCCCGGTTC TTTTTGTCAA 
4381 GACCGACCTG TCCGGTGCCC TGAATGAACT GCAGGACGAG GCAGCGCGGC TATCGTGGCT 
4441 GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG CGGGAAGGGA 
4501 CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC TTGCTCCTGC 
4561 CGAGAAAGTA TCCATCATGG CTGATGCAAT GCGGCGGCTG CATACGCTTG ATCCGGCTAC 
4621 CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC GGATGGAAGC 
4681 CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC CAGCCGAACT 
4741 GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGAGGAT CTCGTCGTGA CCCATGGCGA 
4801 TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA TCGACTGTGG 
4861 CCGGCTGGGT GTGGCGGACC GCTATCAGGA CATAGCGTTG GCTACCCGTG ATATTGCTGA 
4921 AGAGCTTGGC GGCGAATGGG CTGACCGCTT CCTCGTGCTT TACGGTATCG CCGCTCCCGA 
4981 TTCGCAGCGC ATCGCCTTCT ATCGCCTTCT TGACGAGTTC TTCTGAGCGG GACTCTGGGG 
5041 TTCGAAATGA CCGACCAAGC GACGCCCAAC CTGCCATCAC GATGGCCGCA ATAAAATATC 
5101 TTTATTTTCA TTACATCTGT GTGTTGGTTT TTTGTGTGAA TCGATAGCGA TAAGGATCCG 
5161 CGTATGGTGC ACTCTCAGTA CAATCTGCTC TGATGCCGCA TAGTTAAGCC AGCCCCGACA 
5221 CCCGCCAACA CCCGCTGACG CGCCCTGACG GGCTTGTCTG CTCCCGGCAT CCGCTTACAG 
5281 ACAAGCTGTG ACCGTCTCCG GGAGCTGCAT GTGTCAGAGG TTTTCACCGT CATCACCGAA 
5341 ACGCGCGAGA CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT 
5401 AATGGTTTCT TAGACGTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG 
5461 TTTAI I I I IC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT 
5521 GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT 
5581 TCCCI I I I I I GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT 
5641 AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG 
5701 CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA 
5761 AGTTCTGCTA TGTGGCGCGG TATTATCCCG TATTGACGCC GGGCAAGAGC AACTCGGTCG 
5821 CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT 
5881 TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC 
5941 TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG C I I I I I I GCA 
6001 CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT 
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6061 ACCAAACGAC GAGCGTGACA CCACGATGCC TGTAGCAATG GCAACAACGT TGCGCAAACT 
6121 ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC 
6181 GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA 
6241 TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG 
6301 TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG 
6361 AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA 
6421 AGTTTACTCA TATATA CTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA 
6481 GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TMCGTGAGT TTTCGTTCCA 
6541 CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG 
6601 CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA 
6661 TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA 
6721 TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC 
6781 TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG 
6841 TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC 
6901 GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT 
6961 ACAGCGTGAG CATTGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC 
7021 GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG 
7081 GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG 
7141 CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT 
7201 GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA 
7261 TAACCGTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG 
7321 CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCCAATA CGCAAACCGC CTCTCCCCGC 
7381 GCGTTGGCCG ATTCATTAAT GCAGAGCTTG CAATTCGCGC GTTTTTCAAT ATTATTGAAG 
7441 CATTTATCAG GGTTATTGTC TCATGAGCGG ATACATATTT GAATGTATTT AGAAAAATAA 
7501 ACAAATAGGG GTTCCGCGCA CATTTCCCCG AAAAGTGCCA CCTGACGTCT AAGAAACCAT 
7561 TATTATCATG ACATTAACCT ATAAAAATAG GCGTAGTACG AGGCCCTTTC ACTCATTAGA 
7621 TGCATGTCGT TACATAACTT ACGGTAAATG GCCCGCCTGG CTGACCGCCC AACGACCCCC 
7681 GCCCATTGAC GTCAATAATG ACGTATGTTC CCATAGTAAC GCCAATAGGG ACTTTCCATT 
7741 GACGTCAATG GGTGGAGTAT TTACGGTAAA CTGCCCACTT GGCAGTACAT CAAGTGTATC 
7801 ATATGCCAAG TACGCCCCCT ATTGACGTCA ATGACGGTAA ATGGCCCGCC TGGCATTATG 
7861 CCCAGTACAT GACCTTATGG GACTTTCCTA CTTGGCAGTA CATCTACGTA TTAGTCATCG 
7921 CTATTACCAT GGTGATGCGG TTTTGGCAGT ACATCAATGG GCGTGGATAG CGGTTTGACT 
7981 CACGGGGATT TCCAAGTCTC CACCCCATTG ACGTCAATGG GAGTTTGTTT TGGCACCAAA 
8041 ATCAACGGGA CTTTCCAAAA TGTCGTAACA ACTCCGCCCC ATTGACGCAA ATGGGCGGTA 
8101 GGCGTGTACG GTGGGAGGTC TAT 
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pEXP501: pCMV-SPORT 6 HOST FOR ottB LIBRARIES 
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pEXP501 (cont'd) 

FEATURES OF THE attB CLONING VECTOR, pEXP50l. 
BASES WITHIN HATCHED AREA ARE REPLACED BY cDNA 
IN SOME LTI cDNA LIBRARIES. 

866 r^CMV mRNA 

— ago get cgt tta gtg aac cg|t cog ate gcc tgg ago cgc cat cca 
— tct cga gca aat cac ttg gca gtc tag egg acc tct gcg gta ggt 



cgc tgt ttt gac etc cat ago aga cac egg gac cga tec age etc 
gcg aca aaa ctg gag gta tct tct gtg gcc ctg get agg teg gag 



SstI 



LTI rev PRIMER 



egg act eta gcc tag gcc ggg 1 gag egg ata aca att tea cac agg 1 

gcc tga gat egg ate egg [cgc etc gcc tat tgt taa agt gtg tec 

ABI rev PRIMER Stu SP6 PROMOTER r^SP6 

aaa cag eta tga ccb tta gdc ct'a ttt agg tga cac tat algfa [aca 

ttt gtc gat act ggt aat ccg gat aaa tec act gtg ata tct tgt 



Int ottBI Age Kpn Rsrll 

agt tt g tac aaa aaa gca ggc tg g tap [cab te c 



EcoRI 



Sma 



tea aac atg ttt 



ttt cgt ccg cclc atg gcc 



agg 

! T " 

Ec oRV Sal Spe Not 

a ^p&tfj£tf94W?6/!W Ji}^ c |ggc_cgc 



gg |a att ccc 
cct taa I ggg 



ggg 

ccc 



Xba 

t |ct ago gta tec 

t o V^Atf.' * ccg gjcg ago tcjt cat agg 

Xhol Apal Hindi 1 1 Mlu attB2 Int 

c |tc gag g gg cc| c o |ag ct t ajcg_cg t 
gag ctjc ejee ggg ttc gaja tgc gc 



t 



acc cag ctt 
tgg gtc gaa 



t |ct tgt oco oag 
oga oca tg|t ttc 



tgg tec cto tog tga gtc gto tto -taa get agg 
acc c| q ,g got otc oct cag cot aat | at t cga tec 
T7 T7 PROMOTER 



T 

cac tgg ccg teg 

fltq acc ggc age 

ABI fwd <— 

Ml. 1272 

Nhe 

ttt tac aac gtc gtg act ggg ooa act g jet ag e ttg gga tct ttg- 
aaa otg t|tg cag cac tgo cccj ttt tgo cga tcjg aac cct aga aac- 

LTI fwd 
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pEXP501 4396 bp 

1 CCATTCGCCA TTCAGGCTGC GCAACTGTTG GGAAGGGCGA TCGGTGCGGG CCTCTTCGCT 
61 ATTACGCCAG CCAATACGCA AACCGCCTCT CCCCGCGCGT TGGCCGATTC ATTAATGCAG 
121 GATCGATCCA GACATGATAA GATACATTGA TGAGTTTGGA CAAACCACAA CTAGAATGCA 
181 GTGAAAAAAA TGCTTTATTT GTGAAATTTG TGATGCTATT GCTTTATTTG TAACCATTAT 
241 AAGCTGCAAT AAACAAGTTA ACAACAACAA TTGCATTCAT TTTATGTTTC AGGTTCAGGG 
301 GGAGGTGTGG GAGG 1 1 1 1 1 1 AAAGCAAGTA AAACCTCTAC AAATGTGGTA TGGCTGATTA 
361 TGATCATGAA CAGACTGTGA GGACTGAGGG GCCTGAAATG AGCCTTGGGA CTGTGAATCT 
421 AAAATACACA AACAATTAGA ATCACTAGCT CCTGTGTATA ATATTTTCAT AAATCATACT 
481 CAGTAAGCAA AACTCTCAAG CAGCAAGCAT ATGCAGCTAG T1TAACACAT TATACACTTA 
541 AAAATTTTAT ATTTACCTTA GAGCTTTAAA TCTCTGTAGG TAGTTTGTCC AATTATGTCA 
601 CACCACAGAA GTAAGGTTCC TTCACAAAGA TCCCAAGCTA GCAGTTTTCC CAGTCACGAC 
661 GTTGTAAAAC GACGGCCAGT GCCTAGCTTA TAATACGACT CACTATAGGG ACCACTTTGT 
721 ACAAGAAAGC TGGGTACGCG TAAGCTTGGG CCCCTCGAGG GATCCTCTAG AGCGGCCGCC 
781 GACTAGTGAG CTCGTCGACG ATATCCCGGG AATTCCGGAC CGGTACCAGC CTGCI I I 1 1 1 
841 GTACAAACTT GTTCTATAGT GTCACCTAAA TAGGCCTAAT GGTCATAGCT GTTTCCTGTG 
901 TGAAATTGTT ATCCGCTCCG CGGCCTAGGC TAGAGTCCGG AGGCTGGATC GGTCCCGGTG 
961 TCTTCTATGG AGGTCAAAAC AGCGTGGATG GCGTCTCCAG GCGATCTGAC GGTTCACTAA 
1021 ACGAGCTCTG CTTATATAGA CCTCCCACCG TACACGCCTA CCGCCCATTT GCGTCAATGG 
1081 GGCGGAGTTG TTACGACATT TTGGAAAGTC CCGTTGATTT TGGTGCCAAA ACAAACTCCC 
1141 ATTGACGTCA ATGGGGTGGA GACTTGGAAA TCCCCGTGAG TCAAACCGCT ATCCACGCCC 
1201 ATTGATGTAC TGCCAAAACC GCATCACCAT GGTAATAGCG ATGACTAATA CGTAGATGTA 
1261 CTGCCAAGTA GGAAAGTCCC ATAAGGTCAT GTACTGGGCA TAATGCCAGG CGGGCCATTT 
1321 ACCGTCATTG ACGTCAATAG GGGGCGTACT TGGCATATGA TACACTTGAT GTACTGCCAA 
1381 GTGGGCAGTT TACCGTAAAT ACTCCACCCA TTGACGTCAA TGGAAAGTCC CTATTGGCGT 
1441 TACTATGGGA ACATACGTCA TTATTGACGT CAATGGGCGG GGGTCGTTGG GCGGTCAGCC 
1501 AGGCGGGCCA TTTACCGTAA GTTATGTAAC GACATGCATC TAATGAGTGA AAGGGCCTCG 
1561 TACTACGCCT ATTTTTATAG GTTAATGTCA TGATAATAAT GGTTTCTTAG ACGTCAGGTG 
1621 GCACTTTTCG GGGAAATGTG CGCGGAACCC CTATTTGTTT ATTTTTCTAA ATACATTCAA 
1681 ATATGTATCC GCTCATGAGA CAATAACCCT GATAAATGCT TCAATAATAT TGAAAAACGC 
1741 GCGAATTGCA AGCTCTGCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA 
1801 TTGGGCGCTC TTCCGCTTCC TCGCTCACTG ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC 
1861 GAGCGGTATC AGCTCACTCA AAGGCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG 
1921 CAGGAAAGAA CATGTGAGCA AAAGGCCAGC AAAAGGCCAG GAACCGTAAA AAGGCCGCGT 
1981 TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTGACGAGCA TCACAAAAAT CGACGCTCAA 
2041 GTCAGAGGTG GCGAAACCCG ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT 
2101 CCCTCGTGCG CTCTCCTGTT CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC 
2161 CTTCGGGAAG CGTGGCGCTT TCTCAATGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG 
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2221 TCGTTCGCTC CAAGCTGGGC TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT 
2281 TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA CGACTTATCG CCACTGGCAG 
2341 CAGCCACTGG TAACAGGATT AGCAGAGCGA GGTATGTAGG CGGTGCTACA GAGTTCTTGA 
2401 AGTGGTGGCC TAACTACGGC TACACTAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA 
2461 AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG 
2521 GTAGCGGTGG I I I 1 1 1 IGTT TGCAAGCAGC AGATTACGCG CAGAAAAAAA GGATCTCAAG 
2581 AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG GAACGAAAAC TCACGTTAAG 
2641 GGATTTTGGT CATGCCATAA CTTCGTATAG CATACATTAT ACGAAGTTAT GGCATGAGAT 
2701 TATCAAAAAG GATCTTCACC TAGATCCTTT TAAATTAAAA ATGAAGTTTT AAATCAATCT 
2761 AAAGTATATA TGAGTAAACT TGGTCTGACA GTTACCAATG CTTAATCAGT GAGGCACCTA 
2821 TCTCAGCGAT CTGTCTATTT CGTTCATCCA TAGTTGCCTG ACTCCCCGTC GTGTAGATAA 
2881 CTACGATACG GGAGGGCTTA CCATCTGGCC CCAGTGCTGC AATGATACCG CGAGACCCAC 
2941 GCTCACCGGC TCCAGATTTA TCAGCAATAA ACCAGCCAGC CGGAAGGGCC GAGCGCAGAA 
3001 GTGGTCCTGC AACTTTATCC GCCTCCATCC AGTCTATTAA TTGTTGCCGG GAAGCTAGAG 
3061 TAAGTAGTTC GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC CATTGCTACA GGCATCGTGG 
3121 TGTCACGCTC GTCGTTTGGT ATGGCTTCAT TCAGCTCCGG TTCCCAACGA TCAAGGCGAG 
3181 TTACATGATC CCCCATGTTG TGCAAAAAAG CGGTTAGCTC CTTCGGTCCT CCGATCGTTG 
3241 TCAGAAGTAA GTTGGCCGCA GTGTTATCAC TCATGGTTAT GGCAGCACTG CATAATTCTC 
3301 TTACTGTCAT GCCATCCGTA AGATGCTTTT CTGTGACTGG TGAGTACTCA ACCAAGTCAT 
3361 TCTGAGAATA GTGTATGCGG CGACCGAGTT GCTCTTGCCC GGCGTCAATA CGGGATAATA 
3421 CCGCGCCACA TAGCAGAACT TTAAAAGTGC TCATCATTGG AAAACGTTCT TCGGGGCGAA 
3481 AACTCTCAAG GATCTTACCG CTGTTGAGAT CCAGTTCGAT GTAACCCACT CGTGCACCCA 
3541 ACTGATCTTC AGCATCTTTT ACTTTCACCA GCGTTTCTGG GTGAGCAAAA ACAGGAAGGC 
3601 AAAAT GCCGC AAAAAAGGGA ATAAGGGCGA CACGGAAATG TTGAATACTC ATACTCTTCC 
3661 TTTTTCAATA TTATTGAAGC ATTTATCAGG GTTATTGTCT CATGCCAGGG GTGGGCACAC 
3721 ATATTTGATA CCAGCGATCC CTACACAGCA CATAATTCAA TGCGACTTCC CTCTATCGCA 
3781 CATCTTAGAC CTTTATTCTC CCTCCAGCAC ACATCGAAGC TGCCGAGCAA GCCGTTCTCA 
3841 CCAGTCCAAG ACCTGGCATG AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA 
3901 TAGGGGTTCC GCGCACATTT CCCCGAAAAG TGCCACCTGA AATTGTAAAC GTTAATATTT 
3961 TGTTAAAATT CGCGTTAAAT TTTTGTTAAA TCAGCTCATT TTTTAACCAA TAGGCCGAAA 
4021 TCGGCAAAAT CCCTTATAAA TCAAAAGAAT AGACCGAGAT AGGGTTGAGT GTTGTTCCAG 
4081 TTTGGAACAA GAGTCCACTA TTAAAGAACG TGGACTCCAA CGTCAAAGGG CGAAAAACCG 
4141 TCTATCAGGG CGATGGCCCA CTACGTGAAC CATCACCCTA ATCAAGTTTT TTGGGGTCGA 
4201 GGTGCCGTAA AGCACTAAAT CGGAACCCTA AAGGGAGCCC CCGATTTAGA GCTTGACGGG 
4261 GAAAGCCGGC GAACGTGGCG AGAAAGGAAG GGAAGAAAGC GAAAGGAGCG GGCGCTAGGG 
4321 CGCTGGCAAG TGTAGCGGTC ACGCTGCGCG TAACCACCAC ACCCGCCGCG CTTAATGCGC 
4381 CGCTACAGGG CGCGTC 
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pDONR201 (kanR) 



fids 1 4416 

BsrQ 1 4346 

BsaH I 4298 

Hsp 1 4161 

Bsp LU11 14161 

H Afimm 

Drd I 4053 
fiss SI 3988 
Hae 113917 



//pal 1 
r ipo I 22 
•fco0109l22 



fco 57 I 3634 



Xmn I 507 

BspU I 620 

0sol 741 
BsfX I 757 
Worn I 860 
Srf I 874 
Sma I 875 



/sp I 3188 



Pvu I 2987 
Sff / 1 2986 

EcoH I 2898 




Bst 11071 1211 
5ssHII1250 
fiomH1 1291 
Seal 1329 



Wru I 2645 



Dsa\ 
Nco 
Sty\ 

Gsu\ 1643 
E coR 1 1 744 

Kpn2l 1748 



1443 
I 1443 
1443 



Bsak I 2053 
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pDONR201 4470 bp (rotated to position 3516) 



Location (Base Nos.) 
260.. 29 
656.. 961 



Gene Encoded 



attPl 

ccdB 

ccdA 

CmR 

attP2 

Kmr 



1099.. 1184 
1303 . . 1962 
2210.. 2442 
2565.. 3374 
3495.. 4134 



on 



1 GTTAACGCTA GCATGGATCT CGGGCCCCAA ATAATGATTT TATTTTGACT GATAGTGACC 
61 TGTTCGTTGC AACAAATTGA TGAGCAATGC I 1 1 1 I IATAA TGCCAACTTT GTACAAAAAA 
121 GCTGAACGAG AAACGTAAAA TGATATAAAT ATCAATATAT TAAATTAGAT TTTGCATAAA 
181 AAACAGACTA CATAATACTG TAAAACACAA CATATCCAGT CACTATGAAT CAACTACTTA 
241 GATGGTATTA GTGACCTGTA GTCGACCGAC AGCCTTCCAA ATGTTCTTCG GGTGATGCTG 
301 CCAACTTAGT CGACCGACAG CCTTCCAAAT GTTCTTCTCA AACGGAATCG TCGTATCCAG 
361 CCTACTCGCT ATTGTCCTCA ATGCCGTATT AAATCATAAA AAGAAATAAG AAAAAGAGGT 
421 GCGAGCCTCT TTTTTGTGTG ACAAAATAAA AACATCTACC TATTCATATA CGCTAGTGTC 
481 ATAGTCCTGA AAATCATCTG CATCAAGAAC AATTTCACAA CTCTTATACT TTTCTCTTAC 
541 AAGTCGTTCG GCTTCATCTG GATTTTCAGC CTCTATACTT ACTAAACGTG ATAAAGTTTC 
601 TGTAATTTCT ACTGTATCGA CCTGCAGACT GGCTGTGTAT AAGGGAGCCT GACATTTATA 
661 TTCCCCAGAA CATCAGGTTA ATGGCGTTTT TGATGTCATT TTCGCGGTGG CTGAGATCAG 
721 CCACTTCTTC CCCGATAACG GAGACCGGCA CACTGGCCAT ATCGGTGGTC ATCATGCGCC 
781 AGCTTTCATC CCCGATATGC ACCACCGGGT AAAGTTCACG GGAGACTTTA TCTGACAGCA 
841 GACGTGCACT GGCCAGGGGG ATCACCATCC GTGGCCCGGG CGTGTCAATA ATATCACTCT 
901 GTACATCCAC AAACAGACGA TAACGGCTCT CTCTTTTATA GGTGTAAACC TTAAACTGCA 
961 TTTCACCAGT CCCTGTTCTC GTCAGCAAAA GAGCCGTTCA TTTCAATAAA CCGGGCGACC 
1021 TCAGCCATCC CTTCCTGATT TTCCGCTTTC CAGCGTTCGG CACGCAGACG ACGGGCTTCA 
1081 TTCTGCATGG TTGTGCTTAC CAGACCGGAG ATATTGACAT CATATATGCC TTGAGCAACT 
1141 GATAGCTGTC GCTGTCAACT GTCACTGTAA TACGCTGCTT CATAGCACAC CTCTTTTTGA 
1201 OATACTTCGG GTATACATAT CAGTATATAT TCTTATACCG CAAAAATCAG CGCGCAAATA 
1261 CGCATACTGT TATCTGGCTT TTAGTAAGCC GGATCCACGC GATTACGCCC CGCCCTGCCA 
1321 CTCATCGCAG TACTGTTGTA ATTCATTAAG CATTCTGCCG ACATGGAAGC CATCACAGAC 
1381 GGCATGATGA ACCTGAATCG CCAGCGGCAT CAGCACCTTG TCGCCTTGCG TATAATATTT 
1441 GCCCATGGTG AAAACGGGGG CGAAGAAGTT GTCCATATTG GCCACGTTTA AATCAAAACT 
1501 GGTGAAACTC ACCCAGGGAT TGGCTGAGAC GAAAAACATA TTCTCAATAA ACCCTTTAGG 
1561 GAAATAGGCC AGGTTTTCAC CGTAACACGC CACATCTTGC GAATATATGT GTAGAAACTG 
1621 CCGGAAATCG TCGTGGTATT CACTCCAGAG CGATGAAAAC GTTTCAGTTT GCTCATGGAA 
1681 AACGGTGTAA CAAGGGTGAA CACTATCCCA TATCACCAGC TCACCGTCTT TCATTGCCAT 
1741 ACGGAATTCC GGATGAGCAT TCATCAGGCG GGCAAGAATG TGAATAAAGG CCGGATAAAA 
1801 CTTGTGCTTA TTTTTCTTTA CGGTCTTTAA AAAGGCCGTA ATATCCAGCT GAACGGTCTG 
1861 GTTATAGGTA CATTGAGCAA CTGACTGAAA TGCCTCAAAA TGTTCTTTAC GATGCCATTG 
1921 GGATATATCA ACGGTGGTAT ATCCAGTGAT I I I I I I CTCC ATTTTAGCTT CCTTAGCTCC 
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1981 TGAAAATCTC GATAACTCAA AAAATACGCC CGGTAGTGAT CTTATTTCAT TATGGTGAAA 
2041 GTTGGAACCT CTTACGTGCC GATCAACGTC TCATTTTCGC CAAAAGTTGG CCCAGGGCTT 
2101 CCCGGTATCA ACAGGGACAC CAGGATTTAT TTATTCTGCG AAGTGATCTT CCGTCACAGG 
2161 TATTTATTCG GCGCAAAGTG CGTCGGGTGA TGCTGCCAAC TTAGTCGACT ACAGGTCACT 
2221 AATACCATCT AAGTAGTTGA TTCATAGTGA CTGGATATGT TGTGTTTTAC AGTATTATGT 
2281 AGTCTGTTTT TTATGCAAAA TCTAATTTAA TATATTGATA TTTATATCAT TTTACGTTTC 
2341 TCGTTCAGCT TTCTTGTACA AAGTTGGCAT TATAAGAAAG CATTGCTTAT CAATTTGTTG 
2401 CAACGAACAG GTCACTATCA GTCAAAATAA AATCATTATT TGCCATCCAG CTGCAGCTCT 
2461 GGCCCGTGTC TCAAAATCTC TGATGTTACA TTGCACAAGA TAAAAATATA TCATCATGAA 
2521 CAATAAAACT GTCTGCTTAC ATAAACAGTA ATACAAGGGG TGTTATGAGC CATATTCAAC 
2581 GGGAAACGTC GAGGCCGCGA TTAAATTCCA ACATGGATGC TGATTTATAT GGGTATAAAT 
2641 GGGCTCGCGA TAATGTCGGG CAATCAGGTG CGACAATCTA TCGCTTGTAT GGGAAGCCCG 
2701 ATGCGCCAGA GTTGTTTCTG AAACATGGCA AAGGTAGCGT TGCCAATGAT GTTACAGATG 
2761 AGATGGTCAG ACTAAACTGG CTGACGGAAT TTATGCCTCT TCCGACCATC AAGCATTTTA 
2821 TCCGTACTCC TGATGATGCA TGGTTACTCA CCACTGCGAT CCCCGGAAAA ACAGCATTCC 
2881 AGGTATTAGA AGAATATCCT GATTCAGGTG AAAATATTGT TGATGCGCTG GCAGTGTTCC 
2941 TGCGCCGGTT GCATTCGATT CCTGTTTGTA ATTGTCCTTT TAACAGCGAT CGCGTATTTC 
3001 GTCTCGCTCA GGCGCAATCA CGAATGAATA ACGGTTTGGT TGATGCGAGT GATTTTGATG 
3061 ACGAGCGTAA TGGCTGGCCT GTTGAACAAG TCTGGAAAGA AATGCATAAA CTTTTGCCAT 
3121 TCTCACCGGA TTCAGTCGTC ACTCATGGTG ATTTCTCACT TGATAACCTT ATTTTTGACG 
3181 AGGGGAAATT AATAGGTTGT ATTGATGTTG GACGAGTCGG AATCGCAGAC CGATACCAGG 
3241 ATCTTGCCAT CCTATGGAAC TGCCTCGGTG AGTTTTCTCC TTCATTACAG AAACGGCTTT 
3301 TTCAAAAATA TGGTATTGAT AATCCTGATA TGAATAAATT GCAGTTTCAT TTGATGCTCG 
3361 ATGAGTTTTT CTAATCAGAA TTGGTTAATT GGTTGTAACA CTGGCAGAGC ATTACGCTGA 
3421 CTTGACGGGA CGGCGCAAGC TCATGACCAA AATCCCTTAA CGTGA GTTTT CGTTCCACTG 
3481 AGCGTCAGAC CCCGTAGAAA AGATCAAAGG ATCTTCTTGA GATCCTTTTT TTCTGCGCGT 
3541 AATCTGCTGC TTGCAAACAA AAAAACCACC GCTACCAGCG GTGGTTTGTT TGCCGGATCA 
3601 AGAGCTACCA ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC AGAGCGCAGA TACCAAATAC 
3661 TGTCCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG AACTCTGTAG CACCGCCTAC 
3721 ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC AGTGGCGATA AGTCGTGTCT 
3781 TACCGGGTTG GACTCAAGAC GATAGTTACC GGATAAGGCG CAGCGGTCGG GCTGAACGGG 
3841 GGGTTCGTGC ACACAGCCCA GCTTGGAGCG AACGACCTAC ACCGAACTGA GATACCTACA 
3901 GCGTGAGCTA TGAGAAAGCG CCACGCTTCC CGAAGGGAGA AAGGCGGACA GGTATCCGGT 
3961 AAGCGGCAGG GTCGGAACAG GAGAGCGCAC GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA 
4021 TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG CGTCGATTTT TGTGATGCTC 
4081 GTCAGGGGGG CGGAGCCTAT GGAAAAACGC CAGCAACGCG GCCTTTTTAC GGTTCCTGGC 
4141 CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TCCTGCGTTA TCCCCTGATT CTGTGGATAA 
4201 CCGTATTACC GCTAGCCAGG AAGAGTTTGT AGAAACGCAA AAAGGCCATC CGTCAGGATG 
4261 GCCTTCTGCT TAGTTTGATG CCTGGCAGTT TATGGCGGGC GTCCTGCCCG CCACCCTCCG 
4321 GGCCGTTGCT TCACAACGTT CAAATCCGCT CCCGGCGGAT TTGTCCTACT CAGGAG AGCG 
4381 TTCACCGACA AACAACAGAT AAAACGAAAG GCCCAGTCTT CCGACTGAGC CTTTCGTTTT 
4441 ATTTGATGCC TGGCAGTTCC CTACTCTCGC 
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pDONR202 4204 bp 



Location (Base Nos.) 
369.. 127 



Gene Encoded 
attPl 



486.. 1059 
1228.. 2107 
2381.. 2140 
2629.. 3288 
3408.. 3492 
3630.. 3935 



attP2 
CmR 

inactivated ccdA 
ccdB 



ori 
KmR 



1 CGGCATTGAG GACAATAGCG AGTAGGCTGG ATACGACGAT TCCGTTTGAG AAGAACATTT 
61 GGAAGGCTGT CGGTCGACTA AGTTGGCAGC ATCACCCGAA GAACATTTGG AAGGCTGTCG 
121 GTCGACTACA GGTCACTAAT ACCATCTAAG TAGTTGATTC ATAGTGACTG GATATGTTGT 
181 GTTTTACAGT ATTATGTAGT CTGI 1 1 1 1 IA TGCAAAATCT AATTTAATAT ATTGATATTT 
241 ATATCATTTT ACGTTTCTCG TTCAGCTTTT TTGTACAAAG TTGGCATTAT AAAAAAGCAT 
301 TGCTCATCAA TTTGTTGCAA CGAACAGGTC ACTATCAGTC AAAATAAAAT CATTATTTGG 
361 GGCCCGAGAT CCATGCTAGC GGTAATACGG TTATCCACAG AATCAGGGGA TAACGCAGGA 
421 AAGAACATGT GAGCAAAAGG CCAGCAAAAG GCCAGGAACC GTAAAAAGGC CGCGTTGCTG 
481 GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC GAGCATCACA AAAATCGACG CTCAAGTCAG 
541 AGGTGGCGAA ACCCGACAGG ACTATAAAGA TACCAGGCGT TTCCCCCTGG AAGCTCCCTC 
601 GTGCGCTCTC CTGTTCCGAC CCTGCCGCTT ACCGGATACC TGTCCGCCTT TCTCCCTTCG 
661 GGAAGCGTGG CGCTTTCTCA TAGCTCACGC TGTAGGTATC TCAGTTCGGT GTAGGTCGTT 
721 CGCTCCAAGC TGGGCTGTGT GCACGAACCC CCCGTTCAGC CCGACCGCTG CGCCTTATCC 
781 GGTAACTATC GTCTTGAGTC CAACCCGGTA AGACACGACT TATCGCCACT GGCAGCAGCC 
841 ACTGGTAACA GGATTAGCAG AGCGAGGTAT GTAGGCGGTG CTACAGAGTT CTTGAAGTGG 
901 TGGCCTAACT ACGGCTACAC TAGAAGGACA GTATTTGGTA TCTGCGCTCT GCTGAAGCCA 
961 GTTACCTTCG GAAAAAGAGT TGGTAGCTCT TGATCCGGCA AACAAACCAC CGCTGGTAGC 
1021 GGTGGI I I I 1 TTGTTTGCAA GCAGCAGATT ACGCGCAGAA AAAAAGGATC TCAAGAAGAT 
1081 CCTTTGATCT TTTCTACGGG GTCTGACGCT CAGTGGAACG AAAACTCACG TTAAGGGATT 
1141 TTGGTCATGA GCTTGCGCCG TCCCGTCAAG TCAGCGTAAT GCTCTGCCAG TGTTACAACC 
1201 AATTAACCAA TTCTGATTAG AAAAACTCAT CGAGCATCAA ATGAAACTGC AATTTATTCA 
1261 TATCAGGATT ATCAATACCA TATTTTTGAA AAAGCCGTTT CTGTAATGAA GGAGAAAACT 
1321 CACCGAGGCA GTTCCATAGG ATGGCAAGAT CCTGGTATCG GTCTGCGATT CCGACTCGTC 
1381 CAACATCAAT ACAACCTATT AATTTCCCCT CGTCAAAAAT AAGGTTATCA AGTGAGAAAT 
1441 CACCATGAGT GACGACTGAA TCCGGTGAGA ATGGCAAAAG TTTATGCATT TCTTTCCAGA 
1501 CTTGTTCAAC AGGCCAGCCA TTACGCTCGT CATCAAAATC ACTCGCATCA ACCAAACCGT 
1561 TATTCATTCG TGATTGCGCC TGAGCGAGAC GAAATACGCG ATCGCTGTTA AAAGGACAAT 
1621 TACAAACAGG AATCGAATGC AACCGGCGCA GGAACACTGC CAGCGCATCA ACAATATTTT 
1681 CACCTGAATC AGGATATTCT TCTAATACCT GGAATGCTGT TTTTCCGGGG ATCGCAGTGG 
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1741 TGAGTAACCA TGCATCATCA GGAGTACGGA TAAAATGCTT GATGGTCGGA AGAGGCATAA 
1801 ATTCCGTCAG CCAGTTTAGT CTGACCATCT CATCTGTAAC ATCATTGGCA ACGCTACCTT 
1861 TGCCATGTTT CAGAAACAAC TCTGGCGCAT CGGGCTTCCC ATACAAGCGA TAGATTGTCG 
1921 CACCTGATTG CCCGACATTA TCGCGAGCCC ATTTATACCC ATATAAATCA GCATCCATGT 
1981 TGGAATTTAA TCGCGGCCTC GACGTTTCCC GTTGAATATG GCTC ATAACA CCCCTTGTAT 
2041 TACTGTTTAT GTAAGCAGAC AGTTTTATTG TTCATGATGA TATATTTTTA TCTTGTGCAA 
2101 TGTAACATCA GAGATTTTGA GACACGGGCC AGAGCTGCAG CTGGATGGCA AATAATGATT 
2161 TTATTTTGAC TGATAGTGAC CTGTTCGTTG CAACAAATTG ATAAGCAATG CTTTCTTATA 
2221 ATGCCAACTT TGTACAAGAA AGCTGAACGA GAAACGTAAA ATGATATAAA TATCAATATA 
2281 TTAAATTAGA TTTTGCATAA AAAACAGACT ACATAATACT GTAAAACACA ACATATCCAG 
2341 TCACTATGAA TCAACTACTT AGATGGTATT AGTGACCTGT AGTCGACTAA GTTGGCAGCA 
2401 TCACCCGACG CACTTTGCGC CGAATAAATA CCTGTGACGG AAGATCACTT CGCAGAATAA 
2461 ATAAATCCTG GTGTCCCTGT TGATACCGGG AAGCCCTGGG CCAACTTTTG GCGAAAATGA 
2521 GACGTTGATC GGCACGTAAG AGGTTCCAAC TTTCACCATA ATGAAATAAG ATCACTACCG 
2581 GGCGTATTTT TTGAGTTATC GAGATTTTCA GGAGCTAAGG AAGCTAAAAT GGAGAAAAAA 
2641 ATCACTGGAT ATACCACCGT TGATATATCC CAATGGCATC GTAAAGAACA TTTTGAGGCA 
2701 TTTCAGTCAG TTGCTCAATG TACCTATAAC CAGACCGTTC AGCTGGATAT TACGGCCTTT 
2761 TTAAAGACCG TAAAGAAAAA TAAGCACAAG TTTTATCCGG CCTTTATTCA CATTCTTGCC 
2821 CGCCTGATGA ATGCTCATCC GGAATTCCGT ATGGCAATGA AAGACGGTGA GCTGGTGATA 
2881 TGGGATAGTG TTCACCCTTG TTACACCGTT TTCCATGAGC AAACTGAAAC GTTTTCATCG 
2941 CTCTGGAGTG AATACCACGA CGATTTCCGG CAGTTTCTAC ACATATATTC GCA AGATGTG 
3001 GCGTGTTACG GTGAAAACCT GGCCTATTTC CCTAAAGGGT TTATTGAGAA TATGTTTTTC 
3061 GTCTCAGCCA ATCCCTGGGT GAGTTTCACC AGTTTTGATT TAAACGTGGC CAATATGGAC 
3121 AACTTCTTCG CCCCCGTTTT CACCATGGGC AAATATTATA CGCAAGGCGA CAAGGTGCTG 
3181 ATGCCGCTGG CGATTCAGGT TCATCATGCC GTCTGTGATG GCTTCCATGT CGGCAGAATG 
3241 CTTAATGAAT TACAACAGTA CTGCGATGAG TGGCAGGGCG GGGC GTAAT C GCGTGGATCC 
3301 GGCTTACTAA AAGCCAGATA ACAGTATGCG TATTTGCGCG CTGATTTTTG CGGTATAAGA 
3361 ATATATACTG ATATGTATAC CCGAAGTATG TCAAAAAGAG GTGTGCTATG AAGCAGCGTA 
3421 TTACAGTGAC AGTTGACAGC GACAGCTATC AGTTGCTCAA GGCATATATG ATGTCAATAT 
3481 CTCCGGTCTG GTAAGCACAA CCATGCAGAA TGAAGCCCGT CGTCTGCGTG CCGAACGCTG 
3541 GAAAGCGGAA AATCAGGAAG GGATGGCTGA GGTCGCCCGG TTTATTGAAA TGAACGGCTC 
3601 TTTTGCTGAC GAGAACAGGG ACTGGTGAAA TGCAGTTTAA GGTTTACACC TATAAAAGAG 
3661 AGAGCCGTTA TCGTCTGTTT GTGGATGTAC AGAGTGATAT TATTGACACG CCCGGGCGAC 
3721 GGATGGTGAT CCCCCTGGCC AGTGCACGTC TGCTGTCAGA TAAAGTCTCC CGTGAACTTT 
3781 ACCCGGTGGT GCATATCGGG GATGAAAGCT GGCGCATGAT GACCACCGAT ATGGCCAGTG 
3841 TGCCGGTCTC CGTTATCGGG GAAGAAGTGG CTGATCTCAG CCACCGCGAA AATGACATCA 
3901 AAAACGCCAT TAACCTGATG TTCTGGGGAA TATAAATGTC AGGCTCCCTT ATACACAGCC 
3961 AGTCTGCAGG TCGATACAGT AGAAATTACA GAAACTTTAT CACGTTTAGT AAGTATAGAG 
4021 GCTGAAAATC CAGATGAAGC CGAACGACTT GTAAGAGAAA AGTATAAGAG TTGTGAAATT 
4081 GTTCTTGATG CAGATGATTT TCAGGACTAT GACAC TAGCG TATATGAATA GGTAGATGTT 
4141 TTTATTTTGT CACACAAAAA AGAGGCTCGC ACCTCTTTTT CTTATTTCTT TTTATGATTT 
4201 AATA 
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pDONR203 4208 bp 



Location (Base Nos.) 
47.. 131 
251.. 910 



Gene Encoded 
inactivated ccdA 
CmR 
attP2 



1158.. 1398 
1509.. 2082 
2251. .3130 
3464. .3174 
3812.. 4117 



ori 
KmR 



attPl 
ccdB 



1 GCGTTCGGCA CGCAGACGAC GGGCTTCATT CTGCATGGTT GTGCTTACCA GACCGGAGAT 
61 ATTGACATCA TATATGCCTT GAGCAACTGA TAGCTGTCGC TGTCAACTGT CACTGTAATA 
121 CGCTGCTTCA TAGCACACCT CTTTTTGACA TACTTCGGGT ATACATATCA GTATATATTC 
181 TTATACCGCA AAAATCAGCG CGCAAATACG CATACTGTTA TCTGGCTTTT AGTAAGCCGG 
241 ATCCACGCGT TTACGCCCCG CCCTGCCACT CATCGCAGTA CTGTTGTAAT TCATTAAGCA 
301 TTCTGCCGAC ATGGAAGCCA TCACAGACGG CATGATGAAC CTGAATCGCC AGCGGCATCA 
361 GCACCTTGTC GCCTTGCGTA TAATATTTGC CCATGGTGAA AACGGGGGCG AAGAAGTTGT 
421 CCATATTGGC CACGTTTAAA TCAAAACTGG TGAAACTCAC CCAGGGATTG GCTGAGACGA 
481 AAAACATATT CTCAATAAAC CCTTTAGGGA AATAGGCCAG GTTTTCACCG TAACACGCCA 
541 CATCTTGCGA ATATATGTGT AGAAACTGCC GGAAATCGTC GTGGTATTCA CTCCAGAGCG 
601 ATGAAAACGT TTCAGTTTGC TCATGGAAAA CGGTGTAACA AGGGTGAACA CTATCCCATA 
661 TCACCAGCTC ACCGTCTTTC ATTGCCATAC GGAATTCCGG ATGAGCATTC ATCAGGCGGG 
721 CAAGAATGTG AATAAAGGCC GGATAAAACT TGTGCTTATT TTTCTTTACG GTCTTTAAAA 
781 AGGCCGTAAT ATCCAGCTGA ACGGTCTGGT TATAGGTACA TTGAGCAACT GACTGAAATG 
841 CCTCA AAATG TTCTTTACGA TGCCATTGGG ATATATCAAC GGTGGTATAT CCAGTGATTT 
901 TTTTCTCCAT TTTAGCTTCC TTAGCTCCTG AAAATCTCGA TAACTCAAAA AATACGCCCG 
961 G TAGT GATCT TATTTCATTA TGGTGAAAGT TGGAACCTCT TACGTGCCGA TCAACGTCTC 
1021 ATTTTCGCCA AAAGTTGGCC CAGGGCTTCC CGGTATCAAC AGGGACACCA GGATTTATTT 
1081 ATTCTGCGAA GTGATCTTCC GTCACAGGTA TTTATTCGGC GCAAAGTGCG TCGGGTGATG 
1141 CTGCCAACTT AGTCGACTAC AGGTCACTAA TACCATCTAA GTAGTTGATT CATAGTGACT 
1201 GGATATGTTG TGTTTTACAG TATTATGTAG TCTGI I I I I I ATGCAAAATC TAATTTAATA 
1261 TATTGATATT TATATCATTT TACGTTTCTC GTTCAGCTTT CTTGTACAAA GTTGGCATTA 
1321 TAAGAAAGCA TTGCTTATCA ATTTGTTGCA ACGAACAGGT CACTATCAGT CAAAATAAAA 
1381 TCATTATTTG CCATCCAGCT AGCGGTAATA CGGTTATCCA CAGAATCAGG GGATAACGCA 
1441 GGAAA GAACA TGTGAGCAAA AGGCCAGCAA AAGGCCAGGA ACCGTAAAAA GGCCGCGTTG 
1501 CTGGCGTTTT TCCATAGGCT CCGCCCCCCT GACGAGCATC ACAAAAATCG ACGCTCAAGT 
1561 CAGAGGTGGC GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC TGGAAGCTCC 
1621 CTCGTGCGCT CTCCTGTTCC GACCCTGCCG CTTACCGGAT ACCTGTCCGC CTTTCTCCCT 
1681 TCGGGAAGCG TGGCGCTTTC TCATAGCTCA CGCTGTAGGT ATCTCAGTTC GGTGTAGGTC 
1741 GTTCGCTCCA AGCTGGGCTG TGTGCACGAA CCCCCCGTTC AGCCCGACCG CTGCGCCTTA 
1801 TCCGGTAACT ATCGTCTTGA GTCCAACCCG GTAAGACACG ACTTATCGCC ACTGGCAGCA 
1861 GCCACTGGTA ACAGGATTAG CAGAGCGAGG TATGTAGGCG GTGCTACAGA GTTCTTGAAG 
1921 TGGTGGCCTA ACTACGGCTA CACTAGAAGA ACAGTATTTG GTATCTGCGC TCTGCTGAAG 
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1981 CCAGTTACCT TCGGAA AAA6 AGTTGGTAGC TCTTGATCCG GCAAACAAAC CACCGCTGGT 
2041 AGCGGTGGTT TTTTTGTTTG CAAGCAGCAG ATTACGCGCA GAAAAAAAGG ATCTCAAGAA 
2101 G ATCC TTTGA TCTTTTCTAC GGGGTCTGAC GCTCAGTGGA ACGAAAACTC ACGTTAAGGG 
2161 ATTTTGGTCA TGAGCTTGCG CCGTCCCGTC AAGTCAGCGT AATGCTCTGC CAGTGTTACA 
2221 ACCAATTAAC CAATTCTGAT TAGAAAAACT CATCGAGCAT CAAATGAAAC TGCAATTTAT 
2281 TCATATCAGG ATTATCAATA CCATATTTTT GAAAAAGCCG TTTCTGTAAT GAAGGAGAAA 
2341 ACTCACCGAG GCAGTTCCAT AGGATGGCAA GATCCTGGTA TCGGTCTGCG ATTCCGACTC 
2401 GTCCAACATC AATACAACCT ATTAATTTCC CCTCGTCAAA AATAAGGTTA TCAAGTGAGA 
2461 AATCACCATG AGTGACGACT GAATCCGGTG AGAATGGCAA AAGTTTATGC ATTTCTTTCC 
2521 AGACTTGTTC AACAGGCCAG CCATTACGCT CGTCATCAAA ATCACTCGCA TCAACCAAAC 
2581 CGTTATTCAT TCGTGATTGC GCCTGAGCGA GACGAAATAC GCGATCGCTG TTAAAAGGAC 
2641 AATTACAAAC AGGAATCGAA TGCAACCGGC GCAGGAACAC TGCCAGCGCA TCAACAATAT 
2701 TTTCACCTGA ATCAGGATAT TCTTCTAATA CCTGGAATGC TGTTTTTCCG GGGATCGCAG 
2761 TGGTGAGTAA CCATGCATCA TCAGGAGTAC GGATAAAATG CTTGATGGTC GGAAGAGGCA 
2821 TAAATTCCGT CAGCCAGTTT AGTCTGACCA TCTCATCTGT AACATCATTG GCAACGCTAC 
2881 CTTTGCCATG TTTCAGAAAC AACTCTGGCG CATCGGGCTT CCCATACAAG CGATAGATTG 
2941 TCGCACCTGA TTGCCCGACA TTATCGCGAG CCCATTTATA CCCATATAAA TCAGCATCCA 
3001 TGTTGGAATT TAATCGCGGC CTCGACGTTT CCCGTTGAAT ATGGCTCATA ACACCCCTTG 
3061 TATTACTGTT TATGTAAGCA GACAGTTTTA TTGTTCATGA TGATATATTT TTATCTTGTG 
3121 CAATGTAACA TCAGAGATTT TGAGACACGG GCCAGAGCTG CAGCTAGCAT GGATCTCGGG 
3181 CCCCA AATAA TGATTTTATT TTGACTGATA GTGACCTGTT CGTTGCAACA AATTGATGAG 
3241 CAATGCTTTT TTATAATGCC AACTTTGTAC AAAAAAGCTG AACGAGAAAC GTAAAATGAT 
3301 ATAAATATCA ATATATTAAA TTAGATTTTG CATAAAAAAC AGACTACATA ATACTGTAAA 
3361 ACACAACATA TCCAGTCACT ATGAATCAAC TACTTAGATG GTATTAGTGA CCTGTAGTCG 
3421 ACCGACAGCC TTCCAAATGT TCTTCGGGTG ATGCTGCCAA CTTAGTCGAC CGACAGCCTT 
3481 CCAAATGTTC TTCTCAAACG GAATCGTCGT ATCCAGCCTA CTCGCTATTG TCCTCAATGC 
3541 CGTATTAAAT CATAAAAAGA AATAAGAAAA AGAGGTGCGA GCCTCTTTTT TGTGTGACAA 
3601 AATAAAAACA TCTACCTATT CATATACGCT AGTGTCATAG TCCTGAAAAT CATCTGCATC 
3661 AAGAACAATT TCACAACTCT TATACTTTTC TCTTACAAGT CGTTCGGCTT CATCTGGATT 
3721 TTCAGCCTCT ATACTTACTA AACGTGATAA AGTTTCTGTA ATTTCTACTG TATCGACCTG 
3781 CA GACTG GCT GTGTATAAGG GAGCCTGACA TTTATATTCC CCAGAACATC AGGTTAATGG 
3841 CG I I I I I GAT GTCATTTTCG CGGTGGCTGA GATCAGCCAC TTCTTCCCCG ATAACGGAGA 
3901 CCGGCACACT GGCCATATCG GTGGTCATCA TGCGCCAGCT TTCATCCCCG ATATGCACCA 
3961 CCGGGTAAAG TTCACGGGAG ACTTTATCTG ACAGCAGACG TGCACTGGCC AGGGGGATCA 
4021 CCATCCGTCG CCCGGGCGTG TCAATAATAT CACTCTGTAC ATCCACAAAC AGACGATAAC 
4081 GGCTCTCTC.T TTTATAGGTG TAAACCTTAA ACTGCATTTC ACCAGTCCCT GTTCTCGTCA 
4141 GCAAAAGAGC CGTTCATTTC AATAAACCGG GCGACCTCAG CCATCCCTTC CTGATTTTCC 
4201 GCTTTCCA 
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1 CGGCATTGAG GACAATAGCG AGTAGGCTGG ATACGACGAT TCCGTTTGAG AAGAACATTT 
61 GGAAGGCTGT CGGTCGACTA CAGGTCACTA ATACCATCTA AGTAGTTGAA TCATAGTGAC 
121 TGGATATGTT GTGTTTTACA GTATTATGTA GTCTGTTTTT TATGCAAAAT CTAATTTAAT 
181 ATATTGATAT TTATATCATT TTACGTTTCT CGTTCAGCTT TTTTGTACAA AGTTGGCATT 
241 ATAAAAAAGC ATTGCTTATC AATTTGTTGC AACGAACAGG TCACTATCAG TCAAAATAAA 
301 ATCATTATTT GGGGCCCGAG ATCCATGCTA GCTGCAGTGC GCAGGGCCCG TGTCTCAAAA 
361 TCTCTGATGT TACATTGCAC AAGATAAAAA TATATCATCA TGAACAATAA AACTGTCTGC 
421 TTACATAAAC AGTAATACAA GGGGTGTTAT GAGCCATATT CAACGGGAAA CGTCTTGCTG 
481 GAGGCCGCGA TTAAATTCCA ACATGGATGC TGATTTATAT GGGTATAAAT GGGCTCGCGA 
541 TAATGTCGGG CAATCAGGTG CGACAATCTT TCGATTGTAT GGGAAGCCCG ATGCGCCAGA 
601 GTTGTTTCTG AAACATGGCA AAGGTAGCGT TGCCAATGAT GTTACAGATG AGATGGTCAG 
661 ACTAAACTGG CTGACGGAAT TTATGCCTCT TCCGACCATC AAGCATTTTA TCCGTACTCC 
721 TGATGATGCA TGGTTACTCA CCACTGCGAT CCGCGGGAAA ACAGCATTCC AGGTATTAGA 
781 AGAATATCCT GATTCAGGTG AAAATATTGT TGATGCGCTG GCAGTGTTCC TGCGCCGGTT 
841 GCATTCGATT CCTGTTTGTA ATTGTCCTTT TAACAGCGAT CGCGTATTTC GTCTCGCTCA 
901 GGCGCAATCA CGAATGAATA ACGGTTTGGT TGATGCGAGT GATTTTGATG ACGAGCGTAA 
961 TGGCTGGCCT GTTGAACAAG TCTGGAAAGA AATGCATACG CTTTTGCCAT TCTCACCGGA 
1021 TTCAGTCGTC ACTCATGGTG ATTTCTCACT TGATAACCTT ATTTTTGACG AGGGGAAATT 
1081 AATAGGTTGT ATTGATGTTG GACGAGTCGG AATCGCAGAC CGATACCAGG ATCTTGCCAT 
1141 CCTATGGAAC TGCCTCGGTG AGTTTTCTCC TTCATTACAG AAACGGCTTT TTCAAAAATA 
1201 TGGTATTGAT AATCCTGATA TGAATAAATT GCAGTTTCAT TTGATGCTCG ATGAGTTTTT 
1261 CTAATCAGAA TTGGTTAATT GGTTGTAACA CTGGCAGAGC ATTACGCTGA CTTGACGGGA 
1321 CGGCGNCATG ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT CAGACCCCGT 
1381 AGAAAAGATC AAAGGATCTT CTTGAGATCC I I I I I I ICTG CGCGTAATCT GCTGCTTGCA 
1441 AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC TACCAACTCT 
1501 TTTTCCGAAG GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTCC TTCTAGTGTA 
1561 GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT 
1621 AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG GGTTGGACTC 
1681 AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT CGTGCACACA 
1741 GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG AGCTATGAGA 
1801 AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG 
1861 AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT ATAGTCCTGT 
1921 CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG 
1981 CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT GCTGGCCTTT 
2041 TGCTCACATG TTCTTTCCTG CGTTATCCCC TGATTCTGTG GATAACCGTA TTACCGCTAG 
2101 CTGGATCGGC AAATAATGAT TTTATTTTGA CTGATAGTGA CCTGTTCGTT GCAACAAATT 
2161 GATAAGCAAT GCI I I I 1 1 AT AATGCCAACT TTGTACAAGA AAGCTGAACG AGAAACGTAA 
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2221 AATGATATAA ATATCAATAT ATTAAATTAG 
2281 TGTAAAACAC AACATATCCA GTCACTATGA 
2341 TAGTCGACTA AGTTGGCAGC ATCACCCGAC 
2401 GAAGATCACT TCGCAGAATA AATAAATCCT 
2461 GCCAACTTTT GGCGAAAATG AGACGTTGAT 
2521 CTTACAAGTC GTTCGGCTTC ATCTGGATTT 
2581 GTTTCTGTAA TTTCTACTGT ATCGACCTGC 
2641 TTATATTCCC CAGAACATCA GGTTAATGGC 
2701 ATCAGCCACT TCTTCCCCGA TAACGGAGAC 
2761 GCGCCAGCTT TCATCCCCGA TATGCACCAC 
2821 CAGCAGACGT GCACTGGCCA GGGGGATCAC 
2881 ACTCTGTACA TCCACAAACA GACGATAACG 
2941 CTGCATTTCA CCAGTCCCTG TTCTCGTCAG 
3001 CGACCTGAGC CATCCCTTCC TGATTTTCCG 
3061 CTTCATTCTG CATGGTTGTG CTTACCAGAC 
3121 CAACTGATAG CTGTCGCTGT CAACTGTCAC 
3181 TTTGACATAC TTCGGGTATA CATATCAGTA 
3241 AAATACGCAT ACTGTTATCT GGCTTTTAGT 
3301 TGCCACTCAT CGCAGTACTG TTGTAATTCA 
3361 CAGACGGCAT GATGAACCTG AATCGCCAGC 
3421 TATTTGCCCA TGGTGAAAAC GGGGGCGAAG 
3481 AAACTGGTGA AACTCACCCA GGGATTGGCT 
3541 TTAGGGAAAT AGGCCAGGTT TTCACCGTAA 
3601 AACTGCCGGA AATCGTCGTG GTATTCACTC 
3661 TGGAAAACGG TGTAACAAGG GTGAACACTA 
3721 GCCATACGGA ATTCCGGATG AGCATTCATC 
3781 TAAAACTTGT GCTTATTTTT CTTTACGGTC 
3841 GTCTGGTTAT AGGTACATTG AGCAACTGAC 
3901 CATTGGGATA TATCAACGGT GGTATATCCA 
3961 GCTCCTGAAA ATCTCGATAA CTCAAAAAAT 
4021 TGAAAGTTGG AACCTCTTAC TGTTCTTGAT 
4081 ATATATGAAT AGGTAGATGT TTTTATTTTG 
4141 TCTTATTTCT TTTTATGATT TAATA 



ATTTTGCATA AAAAACAGAC TACATAATAC 
TTCAACTACT TAGATGGTAT TAGTGACCTG 
GCACTTTGCG CCGAATAAAT ACCTGTGACG 
GGTGTCCCTG TTGATACCGG GAAGCCCTGG 
CGGCACATTT CACAACTCTT ATACTTTTCT 
TCAGCCTCTA TACTTACTAA ACGTGATAAA 
AGACTGGCTG TGTATAACGG AGCCTGACAT 
GTTTTTGATG TCATTTTCGC GGTGGCTGAG 
CGGCACACTG GCCATATCGG TGGTCATCAT 
CGGGTAAAGT TCACGGGAGA CTTTATCTGA 
CATCCGTCGC CCGGGCGTGT CAATAATATC 
GCTCTCTOTT TTATAGGTGT AAACCTTAAA 
CAAAAGAGCC GTTCATTTCA ATAAACCGGG 
CTTTCCAGCG TTCGGCACGC AGACGACGGG 
CGGAGATATT GACATCATAT ATGCCTTGAG 
TGTAATACGC TGCTTCATAG CACACCTCTT 
TATATTCTTA TACCGCAAAA ATCAGCGCGC 
AAGCCGGATC CACGCGTTTA CGCCCCGCCC 
TTAAGCATTC TGCCGACATG GAAGCCATCA 
GGCATCAGCA CCTTGTCGCC TTGCGTATAA 
AAGTTGTCCA TATTGGCCAC GTTTAAATCA 
GAGACGAAAA ACATATTCTC AATAAACCCT 
CACGCCACAT CTTGCGAATA TATGTGTAGA 
CAGAGCGATG AAAACGTTTC AGTTTGCTCA 
TCCCATATCA CCAGCTCACC GTCTTTCATT 
AGGCGGGCAA GAATGTGAAT AAAGGCCGGA 
TTTAAAAAGG CCGTAATATC CAGCTGAACG 
TGAAATGCCT CAAAATGTTC TTTACGATGC 
GTGAI I I I I I TCTCCATTTT AGCTTCCTTA 
ACGCCCGGTA GTGATCTTAT TTCATTATGG 
GCAGATGATT TTCAGGACTA TGACACTAGC 
TCACACAAAA AAGAGGCTCG CACCTCTTTT 
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pDONR205 4939 bp 

GGCATCAGCACCTTGTCGCCTTGCGTATAATATTTGCCCATGGTGAAAACGGGGGCGAAG 
AAGTTGTCCATATTGGCCACGTTTAAATCAAAACTGGTGAAACTCACCCAGGGATTGGCT 
GAGACGAAAMCATATTCTCMTAMCCCTTTAGGGAAATAGGCCAGGTTTTCACCGTAA 
CACGCCACATCTTGCGAATATATGTGTAGAAACTGCCGGAAATCGTCGTGGTATTCACTC 
CAGAGCGATGAAAACGTTTCAGTTTGCTCATGGAAAACGGTGTAACAAGGGTGAACACTA 
TCCCATATCACCAGCTCACCGTCTTTCATTGCCATACGGAATTCCGGATGAGCATTCATC 
AGGCGGGCMGMTGTGMTAMGGCCGGATAAMCTTGTGCTTATTTTTCTTTACGGTC 
TTTAAAAAGGCCGTAATATCCAGCTGAACGGTCTGGTTATAGGTACATTGAGCAACTGAC 
TGAA ATGCCTCA AAATGTTCTTTACGATGCCATTGGGATATATCAACGGTGGTATATCCA 
GTGATTTTTTTCTCCATTTTAGCTTCCTTAGCTCCTGAAAATCTCGATAACTCAAAAAAT 
ACGCCCGG TAGT GATCTTATTTCATTATGGTGAAAGTTGGAACCTCTTACGTGCCGATCA 
ACGTCTCATTTTCGCCAAAAGTTGGCCCAGGGCTTCCCGGTATCAACAGGGACACCAGGA 
TTTATTTATTCTGCGMGTGATCTTCCGTCACAGGTATTTATTCGGCGCAAAGTGCGTCG 
GGTGATGCTGCCAACTTAGTCGACTACAGGTCACTAATACCATCTAAGTAGTTGATTCAT 
AGTGACTGGATATGTTGTGTTTTACAGTATTATGTAGTCTG I I 1 1 1 I ATGCAAAATCTAA 
TTTMTATATTGATATTTATATCATTTTACGTTTCTCGTTCAGCTTTCTTGTACAMGTT 
GGCATTATAAGAAAGCATTGCTTATCAATTTGTTGCAACGAACAGGTCACTATCAGTCAA 
AATAAAATCATTATTTGCCATCCAGCTGCAGCTCTGGCCCGTGTCTCAAAATCTCTGATG 
TTACATTGCACAAGATAAAAATATATCATCATGAATTCTCATGTTTGACAGCTTATCATC 
GATMGCTTTAATGCGGTAGTTTATCACAGTTAAATTGCTAACGCAGTCAGGCACCGTGT 
ATGAAATCTAACAATGCGCTCATCGTCATCCTCGGCACCGTCACCCTGGATGCTGTAGGC 
ATAGGCTTGGTTATGCCGGTACTGCCGGGCCTCTTGCGGGATATCGTCCATTCCGACAGC 
ATCGCCAGTCACTATGGCGTGCTGCTAGCGCTATATGCGTTGATGCAATTTCTATGCGCA 
CCCGTTCTCGGAGCACTGTCCGACCGCTTTGGCCGCCGCCCAGTCCTGCTCGCTTCGCTA 
CTTGGAGCCACTATCGACTACGCGATCATGGCGACCACACCCGTCCTGTGGATCCTCTAC 
GCCGGACGCATCGTGGCCGGCATCACCGGCGCCACAGGTGCGGTTGCTGGCGCCTATATC 
GCCGACATCACCGATGGGGAAGATCGGGCTCGCCACTTCGGGCTCATGAGCGCTTGTTTC 
GGCGTGGGTATGGTGGCAGGCCCCGTGGCCGGGGGACTGTTGGGCGCCATCTCCTTGCAT 
GCACCATTCCTTGCGGCGGCGGTGCTCAACGGCCTCAACCTACTACTGGGCTGCTTCCTA 
ATGCAGGAGTCGCATAAGGGAGAGCGTCGACCGATGCCCTTGAGAGCCTTCAACCCAGTC 
AGCTCCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTT 
ATCATGCAACTCGTAGGACAGGTGCCGGCAGCGCTCTGGGTCATTTTCGGCGAGGACCGC 
TTTCGCTGGAGCGCGACGATGATCGGCCTGTCGCTTGCGGTATTCGGAATCTTGCACGCC 
CTCGCTCAAGCCTTCGTCACTGGTCCCGCCACCAAACGTTTCGGCGAGAAGCAGGCCATT 
ATCGCCGGCATGGCGGCCGACGCGCTGGGCTACGTCTTGCTGGCGTTCGCGACGCGAGGC 
TGGATGGCCTTCCCCATTATGATTCTTCTCGCTTCCGGCGGCATCGGGATGCCCGCGTTG 
CAGGCCATGCTGTCCAGGCAGGTAGATGACGACCATCAGGGACAGCTTCAAGGATCGCTC 
GCGGCTCTTACCAGCCTAACTTCGATCATTGGACCGCTGATCGTCACGGCGATTTATGCC 
GCCTCGGCGAGCACATGGAACGGGTTGGCATGGATTGTAGGCGCCGCCCTATACCTTGTC 
TGCCTCCCCGCGTTGCGTCGCGGTGCATGGAGCCGGGCCACCTCGACCTGAATGGAAGCC 
GGCGGCACCTCGCTAACGGATTCACCACTCCAAGAATTGGAGCCAATCAATTCTTGCGGA 
GAACTGTGAATGCGCAAACCAACCCTTGGCAGAACATATCCATCGCATGACCAAAATCCC 
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TTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTC 
TTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACC 
AGCGGTGGTTTGTTTGCCGGATCMGAGCTACCMCTCTTTTTCCGAAGGTAACTGGCTT 
CAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTT 
CAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGC 
TGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAA 
GGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGAC 
CTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGG 
GAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGA 
GCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACT 
TGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAA 
CGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGC 
GTTATCCCCTGATTCTGTGGATAACCGTATTACCGCTAGCCAGGAAGAGTTTGTAGAAAC 
GCAAAAAGGCCATCCGTCAGGATGGCCTTCTGCTTAGTTTGATGCCTGGCAGTTTATGGC 
GGGCGTCCTGCCCGCCACCCTCCGGGCCGTTGCTTCACAACGTTCAAATCCGCTCCCGGC 
GGATTTGTCCTACTCAGGAGAGCGTTCACCGACAAACAACAGATAAAACGAAAGGCCCAG 
TCTTCCGACTGAGCCTTTCGTTTTATTTGATGCCTGGCAGTTCCCTACTCTCGCGTTAAC 
GCTAGCATGGATCTCGGGCCCCAMTMTGATTTTATTTTGACTGATAGTGACCTGTTCG 
TTGCAACAAATTGATGAGCAATGC I I I I I I ATAATGCCAACTTTGTACAAAAAAGCTGAA 
CGAGAMCGTAAMTGATATAMTATCMTATATTAMTTAGATTTTGCATAAAAAACAG 
ACTACATAATACTGTAAAACACAACATATCCAGTCACTATGAATCAACTACTTAGATGGT 
ATTAGTGACCTGTAGTCGACCGACAGCCTTCCAAATGTTCTTCGGGTGATGCTGCCAACT 
TAGTCGACCGACAGCCTTCCAAATGTTCTTCTCAAACGGAATCGTCGTATCCAGCCTACT 
CGCTATTGTCCTCAATGCCGTATTAAATCATAAAAAGAAATAAGAAAAAGAGGTGCGAGC 
CTC I I I I I I GTGTGACAAAATAAAAACATCTACCTATTCATATACGCTAGTGTCATAGTC 
CTGAAMTCATCTGCATCMGMCMTTTCACMCTCTTATACTTTTCTCTTACAAGTCG 
TTCGGCTTCATCTGGATTTTCAGCCTCTATACTTACTAMCGTGAtAAAGTTTCTGTAAT 
TTCTACTGTATCGACCTGCAGACTGGCTGTGTATAAGGGAGCCTGACATTTATATTCCCC 
AGMCATCAGGTTMTGGCGTTTTTGATGTCATTTTCGCGGTGGCTGAGATCAGCCACTT 
CTTCCCCGATAACGGAGACCGGCACACTGGCCATATCGGTGGTCATCATGCGCCAGCTTT 
CATCCCCGATATGCACCACCGGGTAAAGTTCACGGGAGACTTTATCTGACAGCAGACGTG 
CACTGGCCAGGGGGATCACCATCCGTCGCCCGGGCGTGTCAATAATATCACTCTGTACAT 
CCACAMCAGACGATMCGGCTCTCTCTTTTATAGGTGTAAACCTTAAACTGCATTTCAC 
CAGTCCCTGTTCTCGTCAGCAAAAGAGCCGTTCATTTCAATAAACCGGGCGACCTCAGCC 
ATCCCTTCCTGATTTTCCGCTTTCCAGCGTTCGGCACGCAGACGACGGGCTTCATTCTGC 
ATGGTTGTGCTTACCAGACCGGAGATATTGACATCATATATGCCTTGAGCAACTGATAGC 
TGTCGCTGTCMCTGTCACTGTMTACGCTGCTTCATAGCACACCTCTTTTTGACATACT 
TCGGGTATACATATCAGTATATATTCTTATACCGCAAAAATCAGCGCGCAAATACGCATA 
CTGTTATCTGGCTTTTAGTAAGCCGGATCCACGCGATTACGCCCCGCCCTGCCACTCATC 
GCAGTACTGTTGTAATTCATTAAGCATTCTGCCGACATGGAAGCCATCACAGACGGCATG 
ATGAACCTGAATCGCCAGC 
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PDONR206 4415 bp 

C GGC ATTGAGGAC AATAGCGAGTAGGCTGGATAC GACGATTC C GTTTGAGMGAAC ATTT 

GGAAGGCTGTCGGTCGACTACAGGTCACTAATACC ATCTA AGTAGTTGAATCATAGTGAC 

TGGATATGTTGTGTTTTACAGTATTATGTAGTCTG I I I I I 1 ATG CAAAATCTAATTTAAT 

ATATTGATATTTATATCATTTTACGTTTCTCGTTCAGC I I I I I I GTACAAAGTTGGCATT 

ATAAAAMGCATTGCTTATCAATTTGTTGCAACGAACAGGTCACTATCAGTCAAAATAAA 

ATCATTATTTGGGGCCCGAGATCCATGCTAGCGGTAATACGGTTATCCACAGAATCAGGG 

GATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAG 

GCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGA 

CGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCT 

GGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCC 

TTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCG 

GTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGC 

TGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCA 

CTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAG 

TTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCT 

CTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACC 

ACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGA 

TCTCMGMGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCA 

CGTTAAGGGATTTTGGTCATGNCGCCGTCCCGTCAAGTCAGCGTAATGCTCTGCCAGTGT 

TACAACCAATTAACCAATTCTGATTAGAAAAACTCATCGAGCATCAAATGAAACTGCAAT 

TTATTCATATCAGGATTATCMTACCATATTTTTGAAAAAGCCGTTTCTGTAATGAAGGA 

GAAAACTCACCGAGGCAGTTCCATAGGATGGCAAGATCCTGGTATCGGTCTGCGATTCCG 

ACTCGTCCAACATCAATACAACCTATTAGCCGAGGTCTTCCGATCTCCTGAAGCCAGGGC 

AGATCCGTGCACAGCACCTTGCCGTAGAAGAACAGCAAGGCCGCCAATGCCTGACGATGC 

GTGGAGACCGAAACCTTGCGCTCGTTCGCCAGCCAGGACAGAAATGCCTCGACTTCGCTG 

CTGCCCAAGGTTGCCGGGTGACGCACACCGTGGAAACGGATGAAGGCACGAACCCAGTTG 

ACATAAGCCTGTTCGGTTCGTAAACTGTAATGCAAGTAGCGTATGCGCTCACGCAACTGG 

TCCAGAACCTTGACCGAACGCAGCGGTGGTAACGGCGCAGTGGCGGTTTTCATGGCTTGT 

TATGACTG I I I I I I I GTACAGTCTATGCCTCGGGCATCCAAGCAGCAAGCGCGTTACGCC 

GTGGGTCGATGTTTGATGTTATGGAGCAGCAACGATGTTACGCAGCAGCAACGATGTTAC 

GCAGCAGGGCAGTCGCCCTAAAACAAAGTTAGGTGGCTCAAGTATGGGCATCATTCGCAC 

ATGTAGGCTCGGCCCTGACCAAGTCAAATCCATGCGGGCTGCTCTTGATCTTTTCGGTCG 

TGAGTTCGGAGACGTAGCCACCTACTCCCAACATCAGCCGGACTCCGATTACCTCGGGAA 

CTTGCTCCGTAGTAAGACATTCATCGCGCTTGCTGCCTTCGACCAAGAAGCGGTTGTTGG 

CGCTCTCGCGGCTTACGTTCTGCCCAGGTTTGAGCAGCCGCGTAGTGAGATCTATATCTA 

TGATCTCGCAGTCTCCGGCGAGCACCGGAGGCAGGGCATTGCCACCGCGCTCATCAATCT 

CCTCAAGCATGAGGCCAACGCGCTTGGTGCTTATGTGATCTACGTGCAAGCAGATTACGG 

TGACGATCCCGCAGTGGCTCTCTATACAAAGTTGGGCATACGGGAAGAAGTGATGCACTT 

TGATATCGACCCAAGTACCGCCACCTAACAATTCGTTCAAGCCGAGATCGGCTTCCCGGC 

CTAATTTCCCCTCGTCAAAAATAAGGTTATCAAGTGAGAAATCACCATGAGTGACGACTG 

AATCCGGTGAGAATGGCAAAAGCGTATGCATTTCTTTCCAGACTTGTTCAACAGGCCAGC 

CATTACGCTCGTCATCAAAATCACTCGCATCAACCAAACCGTTATTCATTCGTGATTGCG 
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CCTGAGCGAGACGAAATACGCGATCGCTGTTAAAAG6ACAATTACAAACAGGAATCGAAT 

GCAACCGGCGCAGGAACACTGCCAGCGCATCAACAATATTTTCACCTGAATCAGGATATT 

CTTCTAATACCTGGAATGCTGTTTTCCCGCGGATCGCAGTGGTGAGTAACCATGCATCAT 

CAGGAGTACGGATAAAATGCTTGATGGTCGGAAGAGGCATAAATTCCGTCAGCCAGTTTA 

GTCTGACCATCTCATCTGTAACATCATTGGCMCGCTACCTTTGCCATGTTTCAGAAACA 

ACTCTGGCGCATCGGGCTTCCCATACAATCGAAAGATTGTCGCACCTGATTGCCCGACAT 

TATCGCGAGCCCATTTATACCCATATAAATCAGCATCCATGTTGGAATTTAATCGCGGCC 

TCCAGCMGACGTTTCCCGTTGAATATGGCTC ATAACA CCCCTTGTATTACTGTTTATGT 

MGCAGACAGTTTTATTGTTCATGATGATATATTTTTATCTTGTGCAATGTAACATCAGA 

GATTTTGAGACACGGGCCCNGCGCACTGCAGCTGGATCGGCA MTMTGA TTTTATTTTG 

ACTGATAGTGACCTGTTCGTTGCAACAAATTGATAAGCAATGC I 1 1 1 1 1 ATAATGCCAAC 

TTTGTACAAGAAAGCTGAACGAGAAACGTAAAATGATATAAATATCAATATATTAAATTA 

GATTTTGCATAAAAAACAGACTACATAATACTGTAAAACACAACATATCCAGTCACTATG 

ATTCAACTACTTAGATGGTATTAGTGACCTGTAGTCGACTAAGTTGGCAGCATCACCCGA 

CGCACTTTGCGCCGAATAAATACCTGTGACGGAAGATCACTTCGCAGAATAAATAAATCC 

TGGTGTCCCTGTTGATACCGGGMGCCCTGGGCCAACTTTTGGCGAAAATGAGACGTTGA 

TCGGCACGTAAGAGGTTCCAACTTTCACCATAATGAAATAAGATCACTACCGGGCGTATT 

TTTTGAGTTATCGAGATTTTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAATCACTGG 

ATATACCACCGTTGATATATCCCAATGGCATCGTAMGMCATTTTGAGGCATTTCAGTC 

AGTTGCTCMTGTACCTATMCCAGACCGTTCAGCTGGATATTACGGCCTTTTTAAAGAC 

CGTAMGAAAMTMGCACMGTTTTATCCGGCCTTTATTCACATTCTTGCCCGCCTGAT 

GAATGCTCATCCGGAATTCCGTATGGCAATGAAAGACGGTGAGCTGGTGATATGGGATAG 

TGTTCACCCTTGTTACACCGTTTTCCATGAGCAMCTGAMCGTTTTCATCGCTCTGGAG 

TGMTACCACGACGATTTCCGGCAGTTTCTACACATATATTCGCMGATGTGGCGTGTTA 

CGGTGAAMCCTGGCCTATTTCCCTAMGGGTTTATTGAGMTATGTTTTTCGTCTCAGC 

CMTCCCTGGGTGAGTTTCACCAGTTTTGATTTAAACGTGGCCAATATGGACAACTTCTT 

CGCCCCCGTnTCACCATGGGCAAATATTATACGCAAGGCGACAAGGTGCTGATGCCGCT 

GGCGATTCAGGTTCATCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGCTTAATGA 

ATTACAACAGTACTGCGATGAGTGGCAGGGCGGGGC GTAAAC GCGTGGATCCGGCTTACT 

AAMGCCAGATMCAGTATGCGTATTTGCGCGCTGATTTTTGCGGTATAAGAATATATAC 

TGATATGTATACCCGAAGTATGTCAAAAAGAGGTGTGCTATGAAGCAGCGTATTACAGTG 

ACAGTTGACAGCGAGAGCTATCAGTTGCTCAAGGCATATATGATGTCAATATCTCCGGTC 

TGGTAAGCACAACCATGCAGAATGAAGCCCGTCGTCTGCGTGCCGAACGCTGGAAAGCGG 

AAMTCAGGMGGGATGGCTGAGGTCGCCCGGTTTATTGAAATGAACGGCTCTTTTGCTG 

ACGAGAACAGGGACTGGTGAAATGCAGTTTAAGGTTTACACCTATAAAAGAGAGAGCCGT 

TATCGTCTGTTTGTGGATGTACAGAGTGATATTATTGACACGCCCGGGCGACGGATGGTG 

ATCCCCCTGGCCAGTGCACGTCTGCTGTCAGATAAAGTCTCCCGTGAACTTTACCCGGTG 

GTGCATATCGGGGATGAAAGCTGGCGCATGATGACCACCGATATGGCCAGTGTGCCGGTC 

TCCGTTATCGGGGAAGAAGTGGCTGATCTCAGCCACCGCGAAAATGACATCAAAAACGCC 

ATTAACCTGATGTTCTGGGGAATATAAATGTCAGGCTCCGTTATACACAGCCAGTCTGCA 

GGTCGATACAGTAGAMTTACAGAMCTTTATCACGTTTAGTAAGTATAGAGGCTGAAAA 

TCCAGATGMGCCGMCGACTTGTAAGAGAAAAGTATAAGAGTTGTGAAA TTGTT CTTGA 

TGCAGATGATTTTCAGGACTATGACACTAGCATATATGM TAGGTA GATGTTTTTATTTT 

GTCACACAAAAMGAGGCTCGCACCTCTTTTTCTTATTTCTTTTTATGATTTAATA 
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An Entry (pENTR7) Clone of CAT Subcloned into pDEST2 

— Star t translation 



1021 egg ata aca att tea cac agg aaa cag ace 



Met Ser Tyr Tyr His His His 
.atg teg tac toe cat cac cat 



gee tat tgt too agt gtg tec ttt gtc tgg tac age atg atg gta gtg gta 



His 6 
_A 



attBI 



^ r 

His His His Gly He Thr Ser Leu Tyr Lys Lys Ala Gly Ph e Glu Asn Leu 



1072 cac cat cac ggc ate 
gtg gta gtg ccg tag 



aca agt tt |g tac aaa aaa 
tgt tea oac atg ttt I ttt 



From pDEST2 
Start CAT 



gca ggc ttt gaa aac ctg 
cgt ccg a|aa ctt ttg gac 



TEV protease 

Tyr Phe Gln^Gly ThrLet Gly Lys Lys He Thr 

1123 tat ttt caa gga ace atg gag aaa aaa ate act 

ata aaa gtt cct tgg tac etc ttt ttt tag tga 



From PENTR7 



Gly Tyr Thr Thr Vol Asp 
gga tat ace ace gtt gat 
cct ata tgg tgg caa eta 



5029, Nsi I 
5029,Ppu10I\ 



Mlul . 13 
IBclI.27 
1/ RstEII.194 
I' ,Bsp1201,220 
" "Apal.220 



4815,Bst I 107. 

4614, Nael 
4614,NgoMU 
4508,DraIII 

4246, Sop I 
4124,BspLUI 



3710,AlwNI 




BspMI ,865 
RsrI I ,993 

BspEI . 1347 
Ball, 1616 



3231 .Abdl - 7-—, 
3011,Fspr / 
2864, Pvu I 



^Kpal.1918 
VAsp718,1918 
Hindi II ,1924 
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Gene ccdB 
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Native Protein Expression: 

ATG Term 




Amp 



Fusion Protein Expression: 
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/ EcoRV,278 
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PRIMERS FOR AMPLIFYING Tet R AND Amp R 
FOR CLONING BY RECOMBINATION 



PRIMERS 



GENE-SPECIFIC 



attBI 



GENE-SPECIFIC 



4G . attBI 



GENE-SPECIFIC 



REVERSE PRIMERS 



GENE-SPECIFIC 



GENE-SPECIFIC . attB2 



GENE-SPECIFIC . oUB2 



EcoR\ 4361 Cla\ 23 

\ / Hind\\\ 29 



Pvu\ 3735 
Pst\ 3609 

Ppa\ 3435 



AatW 4286 My f C0 R V 185 

' Nhe\ 229 

BamH\ 375 
Sph\ 562 

So/I 651 



Sspl 4170 
Scd 3846 




Eag\ 939 

AVwl 972 
BspU\ 1063 

£s/7?l 1353 
S/yl 1369 
Ava\ 1425 
5(7/1 1444 



A(l\\\ 2475 
Mfel 2297 



flspMII 1664 
Pvu\\ 2066 
77/71 1 1 1 2219 
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RESULTS OF CLONING 
tet AND amp PCR PRODUCTS 
BY RECOMBINATION 



PCR PRODUCT 


NO. COLONIES 


FORM OF 


COLONIES 


USED IN GCS 


OBTAINED 


DNA 


OBTAINED OF 


REACTIONS 


(100 ul PLATED) 


ANALYZED 


PREDICTED 








SIZE 


tet 


6. 10 


SC 


0 OF 8 


attB-tet 


9, 6 


SC 


1 OF 8 


ottB+4G-tet 


824, 1064 


SC 


7 OF 7 






Aval+Bam 


7 OF 7 


amp 


7, 13 


SC 


0 OF 8 


attB-amp 


18, 22 


SC 


3 OF 8 


attB+4G-amp 


3020, 3540 


SC 


8 OF 8 






Pstl 


8 OF 8 


attB Plasmid 


320, 394 






(Pos. Control) 
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EFFECT OF G's ON 5" END OF PCR ottB SITES 
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TITRATION OF ottP AMOUNTS WITH VARIOUS ottB PCR AMOUNTS IN 

BxP REACTION 
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TITRATION OF 256 bp PCR IN BxP 
REACTION 
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FIG.69A 
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TITRATION OF 256 bp PCR IN BxP 
REACTION 
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EFFICIENCY OF CLONING OF 256 bp PCR 
INTO A DESTINATION VECTOR 
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TITRATION OF 1 kb PCR IN BxP 
REACTION 
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TITRATION OF 1 kb PCR IN BxP 
REACTION 
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EFFICIENCY OF CLONING OF 1 kb PCR 
INTO A DESTINATION VECTOR 




20 40 60 80 100 120 140 160 
attB PCR CONCENTRATION (fmole) 



FIG.70C 



190/253 



O I— 
O LU 

CD Zj 
O -< 

ce 
O 



po 10000 

LU 

° 1000 H 



oo 



10000 



-gg 1000H 



O I 

O LU 
CD Hj 

g <ai 
o 



100-1 
10 

1 



TITRATION OF 1.4 kb PCR IN BxP 
REACTION 
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TITRATION OF 1.4 kb PCR IN BxP 
REACTION 
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EFFICIENCY OF CLONING OF 1.4 kb PCR 
INTO A DESTINATION VECTOR 
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TITRATION OF 3.4 kb PCR IN BxP 
REACTION 
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TITRATION OF 3.4 kb PCR IN BxP 
REACTION 
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EFFICIENCY OF CLONING OF 3.4 kb PCR 
INTO A DESTINATION VECTOR 
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TITRATION OF 4.6 kb PCR IN BxP 
REACTION 
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TITRATION OF 4.6 kb PCR IN BxP 
REACTION 
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EFFICIENCY OF CLONING OF 4.6 kb PCR 
INTO A DESTINATION VECTOR 
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TITRATION OF 10.1 kb PCR IN BxP 
REACTION 
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Fusion protein 

codon Reoding frame A cassette 



— nnn 



, Reading frame B cassette 

rSr- ' > 

—-nnn nnn nna tea aca agt ttg tac aaa aaa get --- 



— nnn nnn nnt agt tgt tea aac atg ttt ttt cga — 



♦cannot be TG or TA 



Reading frame C cassette 



— nnn nnn nat coa aca agt ttg tac aaa aaa get — 



---nnn nnn nta gtt tgt tea aac atg ttt ttt cga — 



nnn ate aca agt ttg tac aaa aaa get 
nnn tag tgt tea aac atg ttt ttt cga 

* 1 ' 

attR1 
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Esp3l 
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prfC Parent III 



4554 bp 



Location (Base Nos.) 
410.. 286 



660.. 1319 
1439 . . 1523 
1661.. 1966 
2007.. 2131 
2753.. 3613 



Gene Encoded 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

amp 



1 GCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA 
61 CGACAGGTTT CCCGACTGGA AAGCGGGCAG TGAGCGCAAC GCAATTAATG TGAGTTAGCT 
121 CACTCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATGT TGTGTGGAAT 
181 TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC CATGATTACG CCAAGCTTGC 
241 ATGCCTGCAG GTCGACTCTA GAGGATCCCC GGGTACCGAT ATCAAACAAG TT TGTA CAAA 
301 AAAGCTGAAC GAGAAACGTA AAATGATATA AATATCAATA TATTAAATTA GATTTTGCAT 
361 AAAAAACAGA CTACATAATA CTGTAAAACA CAACATATCC AGTCACTATG GCGGCCGCTA 
421 AGTTGGCAGC ATCACCCGAC GCACTTTGCG CCGAATAAAT ACCTGTGACG GAAGAT CACT 
481 TCGCAGAATA AATAAATCCT GGTGTCCCTG TTGATACCGG GAAGCCCTGG GCCAACTTTT 
541 GGCGAAAATG AGACGTTGAT CGGCACGTAA GAGGT TCCA A CTTTCACCAT AATGAAATAA 
601 GATCACTACC GGGCGTATTT TTTGAGTTAT CGAGATTTTC AGGAGCTAAG GAAGCTAAAA 
661 TGGAGAAAAA AATCACTGGA TATACCACCG TTGATATATC CCAATGGCAT CGTAAAGAAC 
721 ATTTTGAGGC ATTTCAGTCA GTTGCTCAAT GTACCTATAA CCAGACCGTT CAGCTGGATA 
781 TTACGGCCTT TTTAAAGACC GTAAAGAAAA ATAAGCACAA GTTTTATCCG GCCTTTATTC 
841 ACATTCTTGC CCGCCTGATG AATGCTCATC CGGAATTCCG TATGGCAATG AAAGACGGTG 
901 AGCTGGTGAT ATGGGATAGT GTTCACCCTT GTTACACCGT TTTCCATGAG CAAACTGAAA 
961 CGTTTTCATC GCTCTGGAGT GAATACCACG ACGATTTCCG GCAGTTTCTA CACATATATT 
1021 CGCAAGATGT GGCGTGTTAC GGTGAAAACC TGGCCTATTT CCCTAAAGGG TTTATTGAGA 
1081 ATATGTTTTT CGTCTCAGCC AATCCCTGGG TGAGTTTCAC CAGTTTTGAT TTAAACGTGG 
1141 CCAATATGGA CAACTTCTTC GCCCCCGTTT TCACCATGGG CAAATATTAT ACGCAAGGCG 
1201 ACAAGGTGCT GATGCCGCTG GCGATTCAGG TTCATCATGC CGTCTGTGAT GGCTTCCATG 
1261 TCGGCAGAAT GCTTAATGAA TTACAACAGT ACTGCGATGA GTGGCAGGGC GGGGC GTAAT 
1321 CTAGAGGATC CGGCTTACTA AAAGCCAGAT AACAGTATGC GTATTTGCGC GCTGATTTTT 
1381 GCGGTATAAG AATATATACT GATATGTATA CCCGAAGTAT GTCAAAAAGA GGTGTGCTAT 
1441 GAAGCAGCGT ATTACAGTGA CAGTTGACAG CGACAGCTAT CAGTTGCTCA AGGCATATAT 
1501 GATGTCAATA TCTCCGGTCT GGTAAGCACA ACCATGCAGA ATGAAGCCCG TCGTCTGCGT 
1561 GCCGAACGCT GGAAAGCGGA AAATCAGGAA GGGATGGCTG AGGTCGCCCG GTTTATTGAA 
1621 ATGAACGGCT CTTTTGCTGA CGAGAACAGG GACTGGTGAA ATGCAGTTTA AGGTTTACAC 
1681 CTATAAAAGA GAGAGCCGTT ATCGTCTGTT TGTGGATGTA CAGAGTGATA TTATTGACAC 
1741 GCCCGGGCGA CGGATGGTGA TCCCCCTGGC CAGTGCACGT CTGCTGTCAG ATAAAGTCTC 
1801 CCGTGAACTT TACCCGGTGG TGCATATCGG GGATGAAAGC TGGCGCATGA TGACCACCGA 
1861 TATGGCCAGT GTGCCGGTCT CCGTTATCGG GGAAGAAGTG GCTGATCTCA GCCACCGCGA 
1921 AAATGACATC AAAAACGCCA TTAACCTGAT GTTCTGGGGA ATATAAATGT CAGGCTCCGT 
1981 TATACACAGC CAGTCTGCAG GTCGACCATA GTGACTGGAT ATGTTGTGTT TTACAGTATT 
2041 ATGTAGTCTG Mill IATGC AAAATCTAAT TTAATATATT GATATTTATA TCATTTTACG 
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2101 TTTCTCGTTC AGCTTTCTTG TACAAAGTGG TTCGATATCG GTACCGAGCT CGAATTCACT 
2161 GGCCGTCGTT TTACAACGTC GTGACTGGGA AAACCCTGGC GTTACCCAAC TTAATCGCCT 
2221 TGCAGCACAT CCCCCTTTCG CCAGCTGGCG TAATAGCGAA GAGGCCCGCA CCGATCGCCC 
2281 TTCCCAACAG TTGCGCAGCC TGAATGGCGA ATGGCGCCTG ATGCGGTATT TTCTCCTTAC 
2341 GCATCTGTGC GGTATTTCAC ACCGCATATG GTGCACTCTC AGTACAATCT GCTCTGATGC 
2401 CGCATAGTTA AGCCAGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT GACGGGCTTG 
2461 TCTG CTCC CG GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT GCATGTGTCA 
2521 GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGACGAAAG GGCCTCGTGA TACGCCTATT 
2581 TTTATAGGTT AATGTCATGA TAATAATGGT TTCTTAGACG TCAGGTGGCA CTTTTCGGGG 
2641 AAATGTGCGC GGAACCCCTA TTTGTTTATT TTTCTAAATA CATTCAAATA TGTATCCGCT 
2701 CATGAGACAA TAACCCTGAT AAATGCTTCA ATAATATTGA AAAAGGAAGA GTATGAGTAT 
2761 TCAACATTTC CGTGTCGCCC TTATTCCCTT TTTTGCGGCA TTTTGCCTTC CTGTTTTTGC 
2821 TCACCCAGAA ACGCTGGTGA AAGTAAAAGA TGCTGAAGAT CAGTTGGGTG CACGAGTGGG 
2881 TTACATCGAA CTGGATCTCA ACAGCGGTAA GATCCTTGAG AGTTTTCGCC CCGAAGAACG 
2941 TTTTCCAATG ATGAGCACTT TTAAAGTTCT GCTATGTGGC GCGGTATTAT CCCGTATTGA 
3001 CGCCGGGCAA GAGCAACTCG GTCGCCGCAT ACACTATTCT CAGAATGACT TGGTTGAGTA 
3061 CTCACCAGTC ACAGAAAAGC ATCTTACGGA TGGCATGACA GTAAGAGAAT TATGCAGTGC 
3121 TGCCATAACC ATGAG TGATA ACACTGCGGC CAACTTACTT CTGACAACGA TCGGAGGACC 
3181 GAAGGAGCTA ACCGCTTTTT TGCACAACAT GGGGGATCAT GTAACTCGCC TTGATCGTTG 
3241 GGAACCGGAG CTGAATGAAG CCATACCAAA CGACGAGCGT GACACCAGGA TGCCTGTAGC 
3301 AATGGCAACA ACGTTGCGCA AACTATTAAC TGGCGAACTA CTTACTCTAG CTTCCCGGCA 
3361 ACAATTAATA GACTGGATGG AGGCGGATAA AGTTGCAGGA CCACTTCTGC GCTCGGCCCT 
3421 TCCGGCTGGC TGGTTTATTG CTGATAAATC TGGAGCCGGT GAGCGTGGGT CTCGCGGTAT 
3481 CATTGCAGCA CTGGGGCCAG ATGGTAAGCC CTCCCGTATC GTAGTTATCT ACACGACGGG 
3541 GAGTCAGGCA ACTATGGATG AACGAAATAG ACAGATCGCT GAGATAGGTG CCTCACTGAT 
3601 TA AGCATT GG TAACTGTCAG ACCAAGTTTA CTCATATATA CTTTAGATTG ATTTAAAACT 
3661 TCATTTTTAA TTTAAAAGGA TCTAGGTGAA GATCCTTTTT GATAATCTCA TGACCAAAAT 
3721 CCCTTAACGT G AGTTTTCG T TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC 
3781 TTCTTGAGAT CCI I 1 1 1 1 IC TGCGCGTAAT CTGCTGCTTG CAAACAAAAA AACCACCGCT 
3841 ACCAGCGGTG GTTTGTTTGC CGGATCAAGA GCTACCAACT CTTTTTCCGA AGGTAACTGG 
3901 CTTCAGCAGA GCGCAGATAC CAAATACTGT CCTTCTAGTG TAGCCGTAGT TAGGCCACCA 
3961 CTTCAAGAAC TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 
4021 TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTACCGGA 
4081 TAAGGCGCAG CGGTCGGGCT GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC 
4141 GACCTACACC GAACTGAGAT ACCTACAGCG TGAGCTATGA GAAAGCGCCA CGCTTCCCGA 
4201 AGGGAGAAAG GCGGACAGGT ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG 
4261 GGAGCTTCCA GGG GGAAA CG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG 
4321 ACTTGAGCGT CGATTT TTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG 
4381 CAACGCGGCC TTTTTACGGT TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTTCC 
4441 TGCGTTATCC CCTGATTCTG TGGATAACCG TATTACCGCC TTTGAGTGAG CTGATACCGC 
4501 TCGCCGCAGC CGAACGACCG AGCGCAGCGA GTCAGTGAGC GAGGAAGCGG AAGA 
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DNA TO BE AMPLIFIED (5^3" ): ollB1 PRIMER: 

ABCO 



w 



C D w 



"x'd c'\ 



w" X 

I AMPLIFICATION CYCLES 

C D w x d'c' t 

C'D'w 1 x'd c 

I DENATURE, ANNEAL 
" attB PRIMERS, 
EXTEND WITH POLYMERASE 

C D w x d'c' 



9999 



oUB2 PRIMER: 

w " x ' abed 

1 DENATURE, ANNEAL gggg — 

HYBRID PRIMERS, 

I EXTEND WITH POLYMERASE HYBRID PRIMERS (PORT 

w x attB, PORT GENE 



ABCD 

9999 



d c b a 



gggg 



C" D" w" x'd c 

J AMPLIFICATION CYCLES 

attBI attB2 



gggg ABCDw x-d'Cb-g 1 cccc 

cccc A.Q.p.n- • 777~iTT~ ^ggg 

ABCDw xdeba 



SPECIFIC): 
C D w 



cd x' 



FIG. 89 



212/253 



CMV PROMOTER 




SV40 ori & EARLY PROMOTER 



FIG.90A 
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pDEST28 7141 bp 

ATGCATGTCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCC 
CGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCAT 
TGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTAT 
CATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTAT 
GCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC 
GCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGAC 
TCACGGGGATTTCCMGTCTCCACCCCATTGACGTCMTGGGAGTTTGTTTTGGCACCAA 
AATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGT 
AGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCTCCCTATCAGTGATAGAGATCTC 
CCTATCAGTGATAGAGATCGTCGACGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGA 
CGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGGACT 
CTAGAGGATCCCTACCGGTGATATCCTCGAGCCCATCAACAAGTTTGTACAAAAAAGCTG 
MCGAGAMCGTAAMTGATATAMTATCMTATATTAMTTAGATTTTGCATAAAAAAC 
AGACTACATAATACTGTAAAACACAACATATCCAGTCACTATGGCGGCCGCATTAGGCAC 
CCCAGGCTTTACACTTTATGCTTCCGGCTCGTATMTGTGTGGATTTTGAGTTAGGATCC 
GGCGAGATTTTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAATCACTGGATATACCAC 
CGTTGATATATCCCMTGGCATCGTAMGMCATTTTGAGGCATTTCAGTCAGTTGCTCA 
ATGTACCTATMCCAGACCGTTCAGCTGGATATTACGGCCTTTTTAAAGACCGTAAAGAA 
AMTMGCACMGTTTTATCCGGCCTTTATTCACATTCTTGCCCGCCTGATGAATGCTCA 
TCCGGAATTCCGTATGGCAATGAAAGACGGTGAGCTGGTGATATGGGATAGTGTTCACCC 
TTGTTACACCGTTTTCCATGAGCAMCTGAMCGTTTTCATCGCTCTGGAGTGAATACCA 
CGACGATTTCCGGCAGTTTCTACACATATATTCGCAAGATGTGGCGTGTTACGGTGAAAA 
CCTGGCCTATTTCCCTAMGGG7TTATTGAGMTATGTTTTTCGTCTCAGCCAATCCCTG 
GGTGAGTTTCACCAGTTTTGATTTAAACGTGGCCAATATGGACAACTTCTTCGCCCCCGT 
TTTCACCATGGGCAAATATTATACGCAAGGCGACAAGGTGCTGATGCCGCTGGCGATTCA 
GGTTCATCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGCTTAATGAATTACAACA 
GTACTGCGATGAGTGGCAGGGCGGGGCGTAAAGATCTGGATCCGGCTTACTAAAAGCCAG 
ATMCAGTATGCGTATTTGCGCGCTGATTTTTGCGGTATAAGAATATATACTGATATGTA 
TACCCGAAGTATGTCAAAAAGAGGTGTGCTATGAAGCAGCGTATTACAGTGACAGTTGAC 
AGCGACAGCTATCAGTTGCTCAAGGCATATATGATGTCAATATCTCCGGTCTGGTAAGCA 
CAACCATGCAGAATGAAGCCCGTCGTCTGCGTGCCGAACGCTGGAAAGCGGAAAATCAGG 
MGGGATGGCTGAGGTCGCCCGGTTTATTGAAATGMCGGCTCTTTTGCTGACGAGAACA 
GGGACTGGTGAMTGCAGTTTAAGGTTTACACCTATAAAAGAGAGAGCCGTTATCGTCTG 
TTTGTGGATGTACAGAGTGATATTATTGACACGCCCGGGCGACGGATGGTGATCCCCCTG 
GCCAGTGCACGTCTGCTGTCAGATAAAGTCTCCCGTGAACTTTACCCGGTGGTGCATATC 
GGGGATGAAAGCTGGCGCATGATGACCACCGATATGGCCAGTGTGCCGGTCTCCGTTATC 
GGGGAAGAAGTGGCTGATCTCAGCCACCGCGAAAATGACATCAAAAACGCCATTAACCTG 
ATGTTCTGGGGAATATAAATGTCAGGCTCCCTTATACACAGCCAGTCTGCAGGTCGACCA 
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TAGTGACTGGATATGTTGTGTTTTACAGTATTATGTAGTCTG I I I I I I ATGCAAAATCTA 
ATTTMTATATTGATATTTATATCATTTTACGTTTCTCGTTCAGCTTTCTTGTACAAAGT 
GGTTGATGGGCGGCCGCTCTAGAGGGCCCAAGCTTACGCGTGCATGCGACGTCATAGCTC 
TCTCCCTATAGTGAGTCGTATTATAAGCTAGGCACTGGCCGTCGTTTTACAACGTCGTGA 
CTGGGAAAACTGCTAGCTTGGGATCTTTGTGAAGGAACCTTACTTCTGTGGTGTGACATA 
ATTGGACAMCTACCTACAGAGATTTAMGCTCTMGGTAMTATAAMTTTTTAAGTGT 
ATMTGTGTTAMCTAGCTGCATATGCTTGCTGCTTGAGAGTTTTGCTTACTGAGTATGA 
TTTATGAAMTATTATACACAGGAGCTAGTGATTCTMTTGTTTGTGTATTTTAGATTCA 
CAGTCCCAAGGCTCATTTCAGGCCCCTCAGTCCTCACAGTCTGTTCATGATCATAATCAG 
CCATACCACATTTGTAGAGGTTTTACTTGCTTTAAAAAACCTCCCACACCTCCCCCTGAA 
CCTGAMCATAAMTGMTGCMTTGTTGTTGTTAACTTGTTTATTGCAGCTTATAATGG 
TTACAMTAMGCMTAGCATCACAMTTTCACAMTAMGCATTTTTTTCACTGCATTC 
TAGTTGTGGTTTGTCCAAACTCATCAATGTATCTTATCATGTCTGGATCGATCCTGCATT 
AATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGCTGGCGTAATAGCGAAG 
AGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGGACGCGC 
CCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACAC 
TTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCG 
CCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTT 
TACGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGC 
CCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCT 
TGTTCCAMCTGGMCMCACTCMCCCTATCTCGGTCTATTCTTTTGATTTATAAGGGA 
TTTTGCCGATTTCGGCCTATTGGTTAAAAMTGAGCTGATTTMCAMTATTTAACGCGA 
ATTTTMCAAMTATTMCGTTTACMTTTCGCCTGATGCGGTATTTTCTCCTTACGCAT 
CTGTGCGGTATTTCACACCGCATACGCGGATCTGCGCAGCACCATGGCCTGAAATAACCT 
CTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAACCAGCTGTGGAATGTGT 
GTCAGTTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGC 
ATCTCAATTAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTA 
TGCAAAGCATGCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGCCCATCC 
CGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGACTAA I I I I I I I I A 
TTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCT 
TTTTTGGAGGCCTAGGCTTTTGCAAAMGCTTGATTCTTCTGACACAACAGTCTCGAACT 
TAAGACCATGGCCAAGCCTTTGTCTCAAGAAGAATCCACCCTCATTGAAAGAGCAACGGC 
TACAATCAACAGCATCCCCATCTCTGAAGACTACAGCGTCGCCAGCGCAGCTCTCTCTAG 
CGACGGCCGCATCTTCACTGGTGTCAATGTATATCATTTTACTGGGGGACCTTGTGCAGA 
ACTCGTGGTGCTGGGCACTGCTGCTGCTGCGGCAGCTGGCAACCTGACTTGTATCGTCGC 
GATCGGAAATGAGAACAGGGGCATCTTGAGCCCCTGCGGACGGTGCCGACAGGTGCTTCT 
CGATCTGCATCCTGGGATCAAAGCCATAGTGAAGGACAGTGATGGACAGCCGACGGCAGT 
TGGGATTCGTGAATTGCTGCCCTCTGGTTATGTGTGGGAGGGCTAAGCACTTCGTGGCCG 
AGTTCGAAATGACCGACCAAGCGACGCCCAACCTGCCATCACGATGGCCGCAATAAAATA 
TCTTTATTTTCATTACATCTGTGTGTTGG I I I I I I GTGTGAATCGATAGCGATAAGGATC 
CGCGTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGA 
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CACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTAC 
AGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCG 
AMCGCGCGAGACGAMGGGCCTCGTGATAC'GCCTATTTTTATAGGTTAATGTCATGATA 
ATMTGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATT 
TGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAA 
ATGCTTCMTMTATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTT 
ATTCCC 1 1 1 1 1 1 GCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAA 
GTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAAC 
AGCGGTMGATCCTTGAGAGTTTTCGCCCCGMGMCGTTTTCCMTGATGAGCACTTTT 
AAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGT 
CGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCAT 
CTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAAC 
ACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGC I I I I II G 
CACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCC 
ATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAA 
CTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAG 
GCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCT 
GATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGAT 
GGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAA 
CGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGAC 
CMGTTTACTCATATATACTTTAGATTGATTTAAAACTTCA I I 1 1 I AATTTAAAAGGATC 
TAGGTGMGATCCTTTTTGATMTCTCATGACCAAMTCCCTTMCGTGAGTTTTCGTTC 
CACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCC I I I I I I I CTG 
CGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCG 
GATCMGAGCTACCMCTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCA 
AATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCG 
CCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCG 
TGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGA 
ACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATAC 
CTACAGCGTGAGCATTGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTAT 
CCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCC 
TGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGA 
TGCTCG TCAG GGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTC 
CTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTG 
GATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAG 
CGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCC 
GCGCGTTGGCCGATTCATTMTGCAGAGCTTGCMTTCGCGCGTTTTTCAATATTATTGA 
AGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAAT 
AAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACC 
ATTATTATCATGACATTAACCTATAAAAATAGGCGTAGTACGAGGCCCTTTCACTCATTA 
G 
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pDEST29 7156 bp 

ATGCATGTCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCC 
CGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCAT 
TGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTAT 
CATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTAT 
GCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC 
GCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGAC 
TCACGGGGATTTCCMGTCTCCACCCCATTGACGTCMTGGGAGTTTGTTTTGGCACCAA 
AATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGT 
AGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCTCCCTATCAGTGATAGAGATCTC 
CCTATCAGTGATAGAGATCGTCGACGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGA 
CGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGGACC 
ATGGCGTACTACCATCACCATCACCATCACACCGGTGATATCCTCGAGCCCATCACAAGT 
TTGTACAAAAAAGCTGAACGAGAAACGTAAAATGATATAAATATCAATATATTAAATTAG 
ATTTTGCATAAAAAACAGACTACATAATACTGTAAAACACAACATATCCAGTCACTATGG 
CGGCCGCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATAATGTGTGGA 
TTTTGAGTTAGGATCCGGCGAGATTTTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAA 
TCACTGGATATACCACCGTTGATATATCCCAATGGCATCGTAAAGAACATTTTGAGGCAT 
TTCAGTCAGTTGCTCMTGTACCTATMCCAGACCGTTCAGCTGGATATTACGGCCTTTT 
TAMGACCGTAMGAAAMTMGCACMGTTTTATCCGGCCTTTATTCACATTCTTGCCC 
GCCTGATGAATGCTCATCCGGAATTCCGTATGGCAATGAAAGACGGTGAGCTGGTGATAT 
GGGATAGTGTTCACCCTTGTTACACCGTTTTCCATGAGCAMCTGAMCGTTTTCATCGC 
TCTGGAGTGMTACCACGACGATTTCCGGCAGTTTCTACACATATATTCGCAAGATGTGG 
CGTGTTACGGTGAAMCCTGGCCTATTTCCCTAMGGGTTTATTGAGMTATGTTTTTCG 
TCTCAGCCMTCCCTGGGTGAGTTTCACCAGTTTTGATTTAAACGTGGCCAATATGGACA 
ACTTCTTCGCCCCCGTTTTCACCATGGGCAAATATTATACGCAAGGCGACAAGGTGCTGA 
TGCCGCTGGCGATTCAGGTTCATCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGC 
TTAATGAATTACAACAGTACTGCGATGAGTGGCAGGGCGGGGCGTAAACGCGTGGATCCG 
GCTTACTAAMGCCAGATMCAGTATGCGTATTTGCGCGCTGATTTTTGCGGTATAAGAA 
TATATACTGATATGTATACCCGAAGTATGTCAAAAAGAGGTGTGCTATGAAGCAGCGTAT 
TACAGTGACAGTTGACAGCGACAGCTATCAGTTGCTCAAGGCATATATGATGTCAATATC 
TCCGGTCTGGTAAGCACAACCATGCAGAATGAAGCCCGTCGTCTGCGTGCCGAACGCTGG 
AAAGCGGAAAATCAGGAAGGGATGGCTGAGGTCGCCCGGTTTATTGAAATGAACGGCTCT 
TTTGCTGACGAGMCAGGGACTGGTGAMTGCAGTTTAAGGTTTACACCTATAAAAGAGA 
GAGCCGTTATCGTCTGTTTGTGGATGTACAGAGTGATATTATTGACACGCCCGGGCGACG 
GATGGTGATCCCCCTGGCCAGTGCACGTCTGCTGTCAGATAAAGTCTCCCGTGAACTTTA 
CCCGGTGGTGCATATCGGGGATGAAAGCTGGCGCATGATGACCACCGATATGGCCAGTGT 
GCCGGTCTCCGTTATCGGGGAAGAAGTGGCTGATCTCAGCCACCGCGAAAATGACATCAA 
AAACGCCATTAACCTGATGTTCTGGGGAATATAAATGTCAGGCTCCGTTATACACAGCCA 
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GTCT GCAGGTCGACCATAGTGACTGGATATGTTGTGTTTTACAGTATTATGTAGTCT6TT 
TTTTATGCAAMTCTMTTTMTATATTGATATTTATATCATTTTACGTTTCTCGTTCAG 
CTTTCTTGTACAAAGTGGTGATGGGCGGCCGCTCTAGAGGGCCCAAGCTTACGCGTGCAT 
GCGACGTCATAGCTCTCTCCCTATAGTGAGTCGTATTATAAGCTAGGCACTGGCCGTCGT 
TTTACMCGTCGTGACTGGGAAMCTGCTAGCTTGGGATCTTTGTGAAGGAACCTTACTT 
CTGT GGTGT GACATAATTGGACAAACTACCTACAGAGATTTAAAGCTCTAAGGTAAATAT 
AAAA 1 1 1 1 1 MGTGTATMTGTGTTAAACTAGCTGCATATGCTTGCTGCTTGAGAGTTTT 
GCTTACTGAGTATGATTTATGAAMTATTATACACAGGAGCTAGTGATTCTMTTGTTTG 
TGTATTTTAGATTCACAGTCCCAAGGCTCATTTCAGGCCCCTCAGTCCTCACAGTCTGTT 
CATGATCATMTCAGCCATACCACATTTGTAGAGGTTTTACTTGCTTTAAAAAACCTCCC 
ACACCTCCCCCTGMCCTGAMCATAAMTGMTGCMTTGTTGTTGTTAACTTGTTTAT 
TGCAG CTTATAATGGTTACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGCATT 
TTTTTCACTGCATTCTAGTTGTGGTTTGTCCAAACTCATCAATGTATCTTATCATGTCTG 
GATCGATCCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGCT 
GGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATG 
GCGAATGGGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCA 
GCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCT 
TTCTCGCCACGTTCGCCGGCTTTCCCCGTCMGCTCTAAATCGGGGGCTCCCTTTAGGGT 
TCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCAC 
GTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCT 
TTAATAGTGGACTCTTGTTCCAAACTGGMCAACACTCAACCCTATCTCGGTCTATTCTT 
TTGATTTATMGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAMTGAGCTGATTTAAC 
AMTATTTMCGCGMTTTTMCAAMTATTMCGTTTACMTTTCGCCTGATGCGGTAT 
TTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATACGCGGATCTGCGCAGCACCAT 
GGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAACC 
AGCTGTGGAATGTGTGTCAGTTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAA 
GTATGCAAAGCATGCATCTCAATTAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCCC 
CAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCATAGTCCCGCCCC 
TAACTC CGCCCATC CCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCT 
GACTAA I II I I I I I ATTTAT GCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGA 
AGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTTGATTCTTCTGACA 
CAACAGTCTCGAACTTAAGACCATGGCCAAGCCTTTGTCTCAAGAAGAATCCACCCTCAT 
TGAAAGAGCAACGGCTACAATCAACAGCATCCCCATCTCTGAAGACTACAGCGTCGCCAG 
CGCAGCTCTCTCTAGCGACGGCCGCATCTTCACTGGTGTCAATGTATATCATTTTACTGG 
GGGACCTTGTGCAGAACTCGTGGTGCTGGGCACTGCTGCTGCTGCGGCAGCTGGCAACCT 
GACTTGTATCGTCGCGATCGGAAATGAGAACAGGGGCATCTTGAGCCCCTGCGGACGGTG 
CCGACAGGTGCTTCTCGATCTGCATCCTGGGATCAAAGCCATAGTGAAGGACAGTGATGG 
ACAGCCGACGGCAGTTGGGATTCGTGAATTGCTGCCCTCTGGTTATGTGTGGGAGGGCTA 
AGCACTTCGTGGCCGAGTTCGAAATGACCGACCAAGCGACGCCCAACCTGCCATCACGAT 
GGCCGCMTAAMTATCTTTATTTTCATTACATCTGTGTGTTGGTnTTTGTGTGAATCG 
ATAGCGATAAGGATCCGCGTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAG 
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TTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTC 

CCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTT 

TCACCGTCATCACCGAAACGCGCGAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAG 

GTTMTGTCATGATMTMTGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTG 

CGCGGMCCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGA 

CAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACAT 

TTCCGTGTCGCCCTTATTCCC 1 1 1 1 I I GCGGCATTTTGCCTTCCTGTTTTTGCTCACCCA 

GAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATC 

GMCTGGATCTCMCAGCGGTMGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCA 

ATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGG 

CAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGMTGACTTGGTTGAGTACTCACCA 

GTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATA 

ACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAG 

CTAACCGC I I I I I I GCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCG 

GAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCA 

ACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTA 

ATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCT 

GGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCA 

GCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAG 

GCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCAT 

TGGTMCTGTCAGACCMGTTTACTCATATATACTTTAGATTGATTTAAMCTTCATTTT 

TMTTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAA 

CGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGA 

GATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCG 

GTGGTTTGTTTGCCGGATCMGAGCTACCMCTCTTTTTCCGAAGGTAACTGGCTTCAGC 

AGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAG 

AACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCC 

AGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCG 

CAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTAC 

ACCGAACTGAGATACCTACAGCGTGAGCATTGAGAAAGCGCCACGCTTCCCGAAGGGAGA 

AAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTT 

CCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAG 

CGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCG 

GCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTA 

TCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGC 

AGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATACGC 

AAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGAGCTTGCAATTCGCGCGTT 

TTTCMTATTATTGMGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAA 

TGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCT 

GACGTCTAAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGTAGTACGAGG 

CCCTTTCACTCATTAG 
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pDEST30 7544 bp 

ATGCATGTCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCC 
CGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCAT 
TGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTAT 
CATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTAT 
GCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC 
GCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGAC 
TCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCMTGGGAGTTTGTTTTGGCACCAA 
AATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGT 
AGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCTCCCTATCAGTGATAGAGATCTC 
CCTATCAGTGATAGAGATCGTCGACGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGA 
CGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGGACT 
CTAGAGGATCCCTACCGGTGATATCCTCGAGCCCATCAACAAGTTTGTACAAAAAAGCTG 
MCGAGAMCGTAAMTGATATAMTATCMTATATTAMTTAGATTTTGCATAAAAAAC 
AGACTACATAATACTGTAAAACACAACATATCCAGTCACTATGGCGGCCGCATTAGGCAC 
CCCAGGCTTTACACTTTATGCTTCCGGCTCGTATMTGTGTGGATTTTGAGTTAGGATCC 
GGCGAGATTTTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAATCACTGGATATACCAC 
CGTTGATATATCCCMTGGCATCGTAMGMCATTTTGAGGCATTTCAGTCAGTTGCTCA 
ATGTACCTATMCCAGACCGTTCAGCTGGATATTACGGCCTTTTTAAAGACCGTAAAGAA 
AMTMGCACMGTTTTATCCGGCCTTTATTCACATTCTTGCCCGCCTGATGAATGCTCA 
TCCGGAATTCC GTAT GGCAATGAAAGACGGTGAGCTGGTGATATGGGATAGTGTTCACCC 
TTGTTACACCGTTTTCCATGAGCAAACTGAAACGTTTTCATCGCTCTGGAGTGAATACCA 
CGACGATTTCCGGCAGTTTCTACACATATATTCGCAAGATGTGGCGTGTTACGGTGAAAA 
CCTGGCCTATTTCCCTAMGGGTTTATTGAGMTATGTTTTTCGTCTCAGCCAATCCCTG 
GGTGAGTTTCACCAGTTTTGATTTAAACGTGGCCAATATGGACAACTTCTTCGCCCCCGT 
TTTCACCATGGGCAAATATTATACGCAAGGCGACAAGGTGCTGATGCCGCTGGCGATTCA 
GGTTCATCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGCTTAATGAATTACAACA 
GTACTGCGATGAGTGGCAGGGCGGGGCGTAAAGATCTGGATCCGGCTTAGTAAAAGCCAG 
ATMCAGTATGCGTATTTGCGCGCTGATTTTTGCGGTATAAGAATATATACTGATATGTA 
TACCCGAAGTATGTCAAAAAGAGGTGTGCTATGAAGCAGCGTATTACAGTGACAGTTGAC 
AGCGACAGCTATCAGTTGCTCAAGGCATATATGATGTCAATATCTCCGGTCTGGTAAGCA 
CAACCATGCAGAATGAAGCCCGTCGTCTGCGTGCCGAACGCTGGAAAGCGGAAAATCAGG 
MGGGATGGCTGAGGTCGCCCGGTTTATTGAAATGAACGGCTCTTTTGCTGACGAGAACA 
GGGACTGGTGAMTGCAGTTTAAGGTTTACACCTATAAAAGAGAGAGCCGTTATCGTCTG 
TTTGTGGATGTACAGAGTGATATTATTGACACGCCCGGGCGACGGATGGTGATCCCCCTG 
GCCAGTGCACGTCTGCTGTCAGATAAAGTCTCCCGTGAACTTTACCCGGTGGTGCATATC 
GGGGATGAAAGCTGGCGCATGATGACCACCGATATGGCCAGTGTGCCGGTCTCCGTTATC 
GGGGAAGAAGTGGCTGATCTCAGCCACCGCGAAAATGACATCAAAAACGCCATTAACCTG 
ATGTTCTGGGGAATATAAATGTCAGGCTCCCTTATACACAGCCAGTCTGCAGGTCGACCA 
TAGTGACTGGATATGTTGTGTTTTACAGTATTATGTAGTCTGI I I I I IATGCAAAATCTA 
ATTTMTATATTGATATTTATATCATTTTACGTTTCTCGTTCAGCTTTCTTGTACAAAGT 
GGTTGATGGGCGGCCGCTCTAGAGGGCCCAAGCTTACGCGTGCATGCGACGTCATAGCTC 
TCTCCCTATAGTGAGTCGTATTATAAGCTAGGCACTGGCCGTCGTTTTACAACGTCGTGA 
CTGGGAAAACTGCTAGCTTGGGATCTTTGTGAAGGAACCTTACTTCTGTGGTGTGACATA 
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ATTGGACAMCTACCTACAGAGATTTAAAGCTCTAAGGTAAATATAAAATTTTTAAGTGT 
ATMTGTGTTAAACTAGCTGCATATGCTTGCTGCTTGAGAGTTTTGCTTACTGAGTATGA 
TTTATGAAMTATTATACACAGGAGCTAGTGATTCTMTTGTTTGTGTATTTTAGATTCA 
CAGTCCCAAGGCTCATTTCAGGCCCCTCAGTCCTCACAGTCTGTTCATGATCATAATCAG 
CCATACCACATTTGTAGAGGTTTTACTTGCTTTAAAAAACCTCCCACACCTCCCCCTGAA 
CCTGAMCATAAMTGMTGCMTTGTTGTTGTTAACTTGTTTATTGCAGCTTATAATGG 
TTACAMTAMGCMTAGCATCACAMTTTCACAMTAMGCATTTTTTTCACTGCATTC 
TAGTTGTGGTTTGTCCAAACTCATCAATGTATCTTATCATGTCTGGATCGATCCTGCATT 
AATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGCTGGCGTAATAGCGAAG 
AGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGGACGCGC 
CCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACAC 
TTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCG 
CCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTT 
TACGGCACCTCG ACCCCA AAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGC 
CCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCT 
TGTT CCAMCTGGMCMCACTCMCCCTATCTCGGTCTATTCTTTTGATTTATAAGGGA 
TTTTGCCGATTTCGGCCTATTGGTTAAAAMTGAGCTGATTTAACAAATATTTAACGCGA 
ATTTTMCAAMTATTMCGTTTACMTTTCGCCTGATGCGGTATTTTCTCCTTACGCAT 
CTGTGCGGTATTTCACACCGCATACGCGGATCTGCGCAGCACCATGGCCTGAAATAACCT 
CTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAACCAGCTGTGGAATGTGT 
GTCAGTTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGC 
ATCTCAATTAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTA 
TGCAAAGCATGCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGCCCATCC 
CGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGACTAAI 1 1 I I I 1 1 A 
TTTAT GCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCT 
TTTTTGGAGGCCTAGGCTTTTGCAAAMGCTTGATTCTTCTGACACAACAGTCTCGAACT 
TAAGGCTAGAGCCACCATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTG 
GGTGGAGAGGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGC 
CGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGG 
TGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGT 
TCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTGGG 
CGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCAT 
CATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTACCTGCCCATTCGACCA 
CCAAGCGAAACATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCA 
GGATGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAA 
GGCGCGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAA 
TATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGC 
GGACCGCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGA 
ATGGGCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGC 
CTTCTATCGCCTTCTTGACGAGTTCTTCTGAGCGGGACTCTGGGGTTCGAAATGACCGAC 
CAAGCGACGCCC AACCTG CCATCACGATGGCCGCAATAAAATATCTTTATnTCATTACA 
TCTGTGTGTTGGTTTTTTGTGTGAATCGATAGCGATAAGGATCCGCGTATGGTGCACTCT 
CAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGC 
TGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGT 
CTCCGGGAGCTGCATGTGT CAGAG GTTTTCACCGTCATCACCGAAACGCGCGAGACGAAA 
GGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAATGGTTTCTTAGAC 



FIG.92C 



223/253 



GTCAGGTGGCACTTTTCGGGGAMTGTGCGCGGMCCCCTATTTGTTTATTTTTCTAAAT 
ACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTG 
A AAAA GGAAGAGT ATGAGTA TTCAACATTTCCGTGTCGCCCTTATTCCC I I I I I I GCGGC 
ATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAMGATGCTGAAGA 
TCA GTTG GGTGCACGAGTGGG TTACA TCGAACTGGATCTCAACAGCGGTAAGATCCTTGA 
GAGTTTTCGCCCCGMGMCGTTTTCCMTGATGAGCACTTTTAAAGTTCTGCTATGTGG 
CGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTC 
TCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGAC 
AGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACT 
TCTGACMCGATCGGAGGACCGMGGAGCTMCCGCTTTTTTGCACAACATGGGGGATCA 
TGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCG 
TGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACT 
ACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGG 
ACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGG 
TGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTAT 
CGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGC 
TGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCMGTTTACTCATATAT 
ACmAGATTGAmAAMCTTCATTmMmAAMGGATCTAGGTGMGATCCTTTT 
TGATMTCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCC 
CGTAGAAAAGATCAAAGGATCTTCTTGAGATCC I I I I I I I CTGCGCGTAATCTGCTGCTT 
GC AAACA AAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAAC 
TCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGT 
GTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCT 
GCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGA 
CTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCAC 
ACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCATTG 
AGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGT 
CGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCC 
TGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCG 
GAGC CTATGGAAAMCGCCAGCMCGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCC 
TTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGC 
CTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAG 
CGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCA 
TTMTGCAGAGCTTGCMTTCGCGCGTTTTTCMTATTATTGAAGCATTTATCAGGGTTA 
TTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCC 
GCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGACATT 
AACCTATAAAAATAGGCGTAGTACGAGGCCCTTTCACTCATTAG 
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pDEST31 7559 bp 

atgcatgtcgttacataacttacggtaaatggcccgcctggctgaccgcccaacgacccc 

cgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatagggactttccat 

tgacgtcaatgggtggagtatttacggtaaactgcccacttggcagtacatcaagtgtat 

catatgccaagtacgccccctattgacgtcaatgacggtaaatggcccgcctggca™t 

gcccagtacatgaccttatgggactttcctacttggcagtacatctacgtattagtcatc 

gctattaccatggtgatgcggttttggcagtacatcaatgggcgtggatagcggttrgac 

tcacggggatttccmgtctccaccccattgacgtcmtgggagtttgttttggcaccaa 

aatcaacgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggt 

aggcgtgtacggtgggaggtctatataagcagagctctccctatcagtgatagagatctc 

cctatcagtgatagagatcgtcgacgagctcgtttagtgaaccgtcagatcgcctggaga 

cgccatccacgctgttttgacctccatagaagacaccgggaccgatccagcctccggacc 

atggcgtactaccatcaccatcaccatcacaccggtgatatcctcgagcccatcacaagt 

ttgtacaaaaaagctgaacgagaaacgtaaaatgatataaatatcaatatattaaattag 

attttgcataaaaaacagactacataatactgtaaaacacaacatatccagtcactatgg 

cggccgcattaggcaccccaggctttacactttatgcttccggctcgtataatgtgtgga 

ttttgagttaggatccggcgagattttcaggagctaaggaagctaaaatggagaaaaaaa 

tcactggatataccaccgttgatatatcccaatggcatcgtaaagaacattttgaggcat 

ttcagtcagttgctcmtgtacctatmccagaccgttcagctggatattacggcctttt 

tamgaccgtamgaaamtmgcacmgttttatccggcctttattcacattcttgccc 

gcctgatgaatgctcatccggaattccgtatggcaatgaaagacggtgagctggtgatat 

gggatagtgttcacccttgttacaccgttttccatgagcamctgamcgttttcatcgc 

tctggagtgaataccacgacgatttccggcagtttctacacatatattcgcaagatgtgg 

cgtgttacggtgaamcctggcctatttccctamgggtttattgagmtatgtttttcg 

tctcagccmtccctgggtgagtttcaccagttttgatttaaacgtggccaatatggaca 

acttcttcgcccccgttttcaccatgggcaaatattatacgcaaggcgacaaggtgctga 

tgccgctggcgattcaggttcatcatgccgtctgtgatggcttccatgtcggcagaatgc 

ttaatgaattacaacagtactgcgatgagtggcagggcggggcgtaaacgcgtggatccg 

gcttactaamgccagatmcagtatgcgtatttgcgcgctgatttttgcggtataagaa 

tatatactgatatgtatacccgaagtatgtcaaaaagaggtgtgctatgaagcagcgtat 

tacagtgacagttgacagcgacagctatcagttgctcaaggcatatatgatgtcaatatc 

tccggtctggtaagcacaaccatgcagaatgaagcccgtcgtctgcgtgccgaacgctgg 

aaagcggaaaatcaggaagggatggctgaggtcgcccggtttattgaaatgaacggctct 

tttgctgacgagmcagggactggtgamtgcagtttaaggtttacacctataaaagaga 

gagccgttatcgtctgtttgtggatgtacagagtgatattattgacacgcccgggcgacg 

gatggtgatccccctggccagtgcacgtctgctgtcagataaagtctcccgtgaacttta 

cccggtggtgcatatcggggatgaaagctggcgcatgatgaccaccgatatggccagtgt 

gccggtctccgttatcggggaagaagtggctgatctcagccaccgcgaaaatgacatcaa 

aaacgccattaacctgatgttctggggaatataaatgtcaggctccgttatacacagcca 

gtct gcaggtcgaccatagtgactggatatgttgtgttttacagtattatgtagtctgtt 

ttttatgcaamtctmtttmtatattgatatttatatcattttacgtttctcgttcag 

ctttcttgtacaaagtggtgatgggcggccgctctagagggcccaagcttacgcgtgcat 

gcgacgtcatagctctctccctatagtgagtcgtattataagctaggcactggccgtcgt 

tttacaacgtcgtgactgggaaaactgctagcttgggatctttgtgaaggaaccttactt 
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CTGT GGTG TGACATAATTGGACAAACTACCTACAGAGATTTAAAGCTCTAAGGTAAATAT 
AAMTTT1TMGTGTATMTGTGTTAMCTAGCTGCATATGCTTGCTGCTTGAGAGTTTT 
GCTT ACTGA GTATGATTTATGAAMTATTATACACAGGAGCTAGTGATTCTAATTGTTTG 
TGTATTTTAGATTCACAGTCCCAAGGCTCATTTCAGGCCCCTCAGTCCTCACAGTCTGTT 
CATGATCATMTCAGCCATACCACAmGTAGAGGTTTTACTTGCTTTAAAAAACCTCCC 
ACACCTCCCCCTGMCCTGAMCATAAMTGMTGCMTTGTTGTTGTTAACTTGTTTAT 
TGCAG CTTATMTGGTTACAMTAAAGCMTAGCATCACAMTTTCACAAATAAAGCATT 
TTTTTCACTGCATTCTAGTTGTGGTTTGTCCAAACTCATCAATGTATCTTATCATGTCTG 
GATCGATCCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGCT 
GGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATG 
GCGAATGGGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCA 
GCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCT 
TTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGT 
TCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCAC 
GTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCT 
TTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTCTT 
TTGATTTATMGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAMTGAGCTGATTTAAC 
AMTATTTMCGCGMTTTTMCAAMTATTMCGTTTACAATTTCGCCTGATGCGGTAT 
TTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATACGCGGATCTGCGCAGCACCAT 
GGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAACC 
AGCTGTGGAATGTGTGTCAGTTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAA 
GTATGCAAAGCATGCATCTCAATTAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCCC 
CAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCATAGTCCCGCCCC 
TAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCT 
GACTAA I I I I I 1 1 I ATTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGA 
AGTAGTGAGGAGGC I I 1 1 1 1 GGAGGCCTAGGCTTTTGCAAAAAGCTTGATTCTTCTGACA 
CAACAGTCTCGAACTTAAGGCTAGAGCCACCATGATTGAACAAGATGGATTGCACGCAGG 
TTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGGGCACAACAGACAATCGG 
CTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCM 
GACCGACCTGTCCGGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCT 
GGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGA 
CTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCCTGC 
CGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTAC 
CTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGTACTCGGATGGAAGC 
CGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGAACT 
GTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGGCGA 
TGCCTGCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTGG 
CCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGA 
AGAGCTTGGCGGCGAATGGGCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGA 
TTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTCTTCTGAGCGGGACTCTGGGG 
TTCG AAAT GACCGACCAAGCGACGCCCAACCTGCCATCACGATGGCCGCAATAAAATATC 
TTTATTTTCATTACATCTGTGTGTTGG I I I I I I GTGTGAATCGATAGCGATAAGGATCCG 
CGTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACA 
CCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAG 
ACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAA 
ACGCGCGAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAAT 
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MTGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTG 

TTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAAT 

GCTTCMTAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTAT 

TCCC I I I I I I GCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGT 

AAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAG 

CGGTMGATCCTTGAGAGTTTTCGCCCCGMGMCGTTTTCCMTGATGAGCACTTTTAA 

AGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCG 

CCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCT 

TACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAG TGATAA CAC 

TGCGGCCMCTTACTTCTGACMCGATCGGAGGACCGMGGAGCTMCCGCTTTTTTGCA 

CAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCAT 

ACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACT 

ATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGC 

GGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGA 

TAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGG 

TAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACG 

AAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAA GCATT GGTAACTGTCAGACCA 

AGTTTACTCATATATACTTTAGATTGATTTAAMCTTCATTTTTMTTTAAMGGATCTA 

GGTGMGATCCTTTTTGATMTCTCATGACCAAMTCCCTTMCGTGA GTTTTCG TTCCA 

CTGAGCGTCAGACCCCGTAGAAMGATCAMGGATCTTCTTGAGATCCTTTTTTTCTGCG 

CGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGA 

TCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAA 

TACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCC 

TACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTG 

TCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAAC 

GGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCT 

ACAGCGTGAGCATTGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCC 

GGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGG GGAAA CGCCTG 

GTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGT CGATTT TTGTGATG 

CTCGTCAGGGGGGCGGAGCCTATGGAAAMCGCCAGCMCGCGGCCTTTTTACGGTTCCT 

GGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGA 

TAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCG 

CAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGC 

GCGTTGGCCGATTCATTMTGCAGAGCTTGCMTTCGCGCGTTTTTCAATATTATTGAAG 

CATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAA 

ACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCAT 

TATTATCATGACATTAACCTATAAAAATAGGCGTAGTACGAGGCCCTTTCACTCATTAG 
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pDEST32 12288 bp 

GACGAMGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAATGGTTT 
CTTAGGACGGATCGCTTGCCTGTAACTTACACGCGCCTCGTATCTTTTAATGATGGAATA 
AmGGGMmACTCTGTGmAmATTmATGTmGTAmGGATTTTAGAAAGT 

amtamgmggtagmgagttacggmtgmgaaaaaaaaataaacaaaggtttaaaaa 

atttcmcaaamgcgtactttacatatatatttattagacmgaaaagcagattaaata 

gatatacattcgattmcgatmgtaamtgtaamtcacaggattttcgtgtgtggtct 

tctacacagacaagatgaaacaattcggcattaatacctgagagcaggaagagcaagata 

amggtagtatttgttggcgatccccctagagtcttttacatcttcggaaaacaaaaact 

a i i i i i i ctttaatttc 1 1 1 1 1 1 1 actttctatttttmtttatatatttatattaaaaa 

atttamttatmttatttttatagcacgtgatgaamggacccaggtggcacttttcgg 

ggamtgtgcgcggmcccctatttgtttatttttctaaatacattcaaatatgtatccg 

ctcatgagacaataaccctgataaatgcttcaataatctgcagtgcgcagggcccgtgtc 

tcaamtctctgatg™cattgcacmgataaamtatatcatcatgmcmtaaaact 

gtctgcttacataaacagtaatacaaggggtgttatgagccatattcaacgggaaacgtc 

ttgctggaggccgcgattaaattccmcatggatgctgatttatatgggtataaatgggc 

tcggtagccaaccactagaactatagctagagtcctgggcgaacaaacgatgctcgcctt 

ccagaaaaccgaggatgcgaaccacttcatccggggtcagcaccaccggcaagcgccgcg 

acggccgaggtcttccgatctcctgaagccagggcagatccgtgcacagcaccttgccgt 

agaagaacagcaaggccgccaatgcctgacgatgcgtggagaccgaaaccttgcgctcgt 

tcgccagccaggacagaaatgcctcgacttcgctgctgcccaaggttgccgggtgacgca 

caccgtggaaacggatgaaggcacgaacccagttgacataagcctgttcggttcgtaaac 

tgtaatgcaagtagcgtatgcgctcacgcaactggtccagaaccttgaccgaacgcagcg 

gtggtaacggcgcagtggcggttttcatggcttgttatgactg 1 1 1 i 1 1 1 gtacagtcta 

tgcctcgggcatccaagcagcaagcgcgttacgccgtgggtcgatgtttgatgttatgga 

gcagcaacgatgttacgcagcagcaacgatgttacgcagcagggcagtcgccctaaaaca 

aagttaggtggctcaagtatgggcatcattcgcacatgtaggctcggccctgaccaagtc 

amtccatgcgggctgctcttgatcttttcggtcgtgagttcggagacgtagccacctac 

tcccaacatcagccggactccgattacctcgggaacttgctccgtagtaagacattcatc 

gcgcttgctgccttcgaccaagaagcggttgttggcgctctcgcggcttacgttctgccc 

aggtttgagcagccgcgtagtgagatctatatctatgatctcgcagtctccggcgagcac 

cggaggcagggcattgccaccgcgctcatcaatctcctcaagcatgaggccaacgcgctt 

ggtgcttatgtgatctacgtgcaagcagattacggtgacgatcccgcagtggctctctat 

acaaagttgggcatacgggaagaagtgatgcactttgatatcgacccaagtaccgccacc 

taacaattcgttcaagccgagatcggcttcccggcctaataggttgtattgatgttggac 

gagtcggaatcgcagaccgataccaggatcttgccatcctatggaactgcctcggtgagt 

tttctccttcattacagamcggctttttcaaaaatatggtattgataatcctgatatga 

atamttgcagtttcatttgatgctcgatgagtttttctmtcagmttggttaattggt 

tgtaacactggcagagcattacgctgacttgacgggacggcgncatgaccaaaatccctt 

mcgtgagttttcgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttctt 

GAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAG 
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CGGTGGTTTGTTTGCCGGATCMGAGCTACCMCTCTTTTTCCGAAGGTAACTGGCTTCA 

GCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCA 

AGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTG 

CCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGG 

CGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCT 

ACACCGAACTGAGATACCTACAGCGTGAGCATTGAGAAAGCGCCACGC7TCCCGAAGGGA 

GAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGC 

TTCCAGGG GGGAA CGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTG 

AGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCCGAGCCTATGGAAAAACGCCAGCAACG 

CGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGT 

TATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCC 

GCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATAC 

GCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGACAGGTTTC 

CCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTACCTCACTCATTAGG 

CACCCCAGGCTTTACACTTTATGCTTCCGGCTCCTATGTTGTGTGGAATTGTGAGCGGAT 

AACAATTTCACACAGGAAACAGCTATGACCATGATTACGCCAAGCTCGGAATTAACCCTC 

ACTAAAGGGAACAAAAGCTGGTACCGATCCCGAGCTTTGCAAATTAAAGCCTTCGAGCGT 

CCCAAAACCTTCTCAAGCAAGGTTTTCAGTATAATGTTACATGCGTACACGCGTCTGTAC 

AGAAAAAAAAGAAAAATTTGAAATATAAATAACGTTCTTAATACTAACATAACTATAAAA 

AMTAMTAGGGACCTAGACTTCAGGTTGTCTMCTCCTTCCTTTTCGGTTAGAGCGGAT 

GTGGGGGGAGGGCGTGAATGTAAGCGTGACATAACTAATTACATGATATCGACAAAGGAA 

MGGGGCCTGTTTACTCACAGGCTTTTTTCMGTAGGTMTTMGTCGTTTCTGTCTTTT 

TCCTTCTTCAACCCACCAAAGGCCATCTTG GTAC I I I I I I I I I I I I I I I I I I I ) I I | | | | 

■ " l l I I I I I I I I I I I I I I | | | | | | | | | M | | | | | | | | | | | | i | | | | | | | | | | | | | | | | | 

I I I I I I I I CATAGAAATAATACAGAAGTAGATGTTGAATTAGATTAAACTGAAGATATAT 
MmATTGGAAMTACATAGAGCTTTTTGTTGATGCGCTTAAGCGATCAATTCAACAAC 
ACCACCAGCAGCTCTGATTTTTTCTTCAGCCMCTTGGAGACGAATCTAGCTTTGACGAT 
AACTGGAACATTTGGAATTCTACCCTTACCCAAGATCTTACCGTAACCGGCTGCCAAAGT 
GTCMTMCTGGAGCAGTTTCCTTAGAAGCAGATTTCAAGTATTGGTCTCTCTTGTCTTC 
TGGGATCMTGTCCACMTTTGTCCAAGTTCAAGACTGGCTTCCAGAAATGAGCTTGTTG 
CTTGTGGAAGTATCTCATACCAACCTTACCGAAATAACCTGGATGGTATTTATCCATGTT 
AATTCTGTGGTGATGTTGACCACCGGCCATACCTCTACCACCGGGGTGCTTTCTGTGCTT 
ACCGATACGACCTTTACCGGCTGAGACGTGACCTCTGTGCTTTCTAGTCTTAGTGAATCT 
GGMGGCATTCTTGATTAGTTGGATGATTGTTCTGGGATTTAATGCAAAAATCACTTAAG 
AAGGAAMTCAACGGAGAAAGCAAACGCCATCTTAAATATACGGGATACAGATGAAAGGG 
TTTGAACCTATCTGGAAAATAGCATTAAACAAGCGAAAAACTGCGAGGAAAATTGTTTGC 
GTCTCTGCGGGCTATTCACGCGCCAGAGGAAAATAGGAAAAATAACAGGGCATTAGAAAA 
ATMT TTTGATT TTGG TAATGT GTGGGTCCTGGTGTACAGATGTTACATTGGTTACAGTA 
CTCTTGTTmGCTGTGTTmCGATGMTCTCCAAMTGGTTGTTAGCACATGGAAGAG 

tcacc(^tgctmg™tctctatgtmgctacgtggcgtgacttttgatgaagccgcac 

AAGAGATACAGGATTGGCAACTGCAAATAGAATCTGGGGATCCCCCCTCGAGATCCGGGA 

tcgaagaaatgatggtaaatgaaataggaaatcaaggagcatgaaggcaaaagacaaata 
tmgggtcgmcgaaamtamgtgaamgtgttgatatgatgtatttggctttgcggcg 
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CCGAAAAAACGAGTTTACGCAATTGCACAATCATGCTGACTCTGTGGCGGACCCGCGCTC 

TT GCCG GCCCGGCGATAACGCTGGGCGTGAGGCTGTGCCCGGCGGAG I I I I II GCGCCTG 

CATTTTCCAAGGTTTACCCTGCGCTAAGGGGCGAGATTGGAGAAGCAATAAGAATGCCGG 

TTGGGGTTGCGATGATGACGACCACGACAACTGGTGTCATTATTTAAGTTGCCGAAAGAA 

CCTGAGTGCATTTGCAACATGAGTATACTAGAAGAATGAGCCAAGACTTGCGAGACGCGA 

GTTTGCCGGTGGTGCGAACAATAGAGCGACCATGACCTTGAAGGTGAGACGCGCATAACC 

GCTAGAGTACTTTGAAGAGGAAACAGCAATAGGGTTGCTACCAGTATAAATAGACAGGTA 

CATACMCACTGGAMTGGTTGTCTGTTTGAGTACGCTTTCAATTCATTTGGGTGTGCAC 

TTTATTATGTTACMTATGGMGGGMCTTTACACTTCTCCTATGCACATATATTAATTA 

AAGTCCAATGCTAGTAGAGAAGGGGGGTAACACCCCTCCGCGCTCTTTTCCG AI I 1 1 1 1 1 

CTAMCCGTGGMTATTTCGGATATCCTTTTGTTGTTTCCGGGTGTACAATATGGACTTC 

CTCTTTTCTGGCAACCAAACCCATACATCGGGATTCCTATAATACCTTCGTTGGTCTCCC 

TAACATGTAGGTGGCGGAGGGGAGATATACAATAGAACAGATACCAGACAAGACATAATG 

GGCTAAACAAGACTACACCAATTACACTGCCTCATTGATGGTGGTACATAACGAACTAAT 

ACTGTAGCCCTAGACTTGATAGCCATCATCATATCGMGTTTCACTACCCTTTTTCCATT 

TGCCATCTATTGAAGTAATAATAGGCGCATGCAACTTCTTTTC I I I I I I I I I CTTTTCTC 

TCTCCCCCGTTGTTGTCTCACCATATCCGCAATGACAAAAAAAATGATGGAAGACACTAA 

AGGAAAAAATTAACGACAAAGACAGCACCAACAGATGTCGTTGTTCCAGAGCTGATGAGG 

GGTATCTTCGMCACACGAMCTTTTTCCTTCCTTCATTCACGCACACTACTCTCTAATG 

AGCMCGGTATACGGCCTTCCTTCCAGTTACTTGMTTTGAAATAAAAAAAGTTTGCCGC 

TTTGCTATCMGTATAMTAGACCTGCMTTATTMTCTTTTGTTTCCTCGTCATTGTTC 

TCGTTCCCTTTCTTCCTTGTTTCTTTTTCTGCACAATATTTCAAGCTATACCAAGCATAC 

AATCAACTCCAAGCTTGAAGCAAGCCTCCTGAAAGATGAAGCTACTGTCTTCTATCGAAC 

AAGCATGCGATATTTGCCGACTTAAAAAGCTCAAGTGCTCCAAAGAAAAACCGAAGTGCG 

CCAAGTGTCTGAAGAACAACTGGGAGTGTCGCTACTCTCCCAAAACCAAAAGGTCTCCGC 

TGACT AGGGCA CATCTGACAGAAGTGGAATCAAGGCTAGAAAGACTGGAACAGCTATTTC 

TACTGATTTTTCCTCGAGMGACCTTGACATGATTTTGAAMTGGATTCTTTACAGGATA 

TAAMGCATTGTTAACAGGATTATTTGTACAAGATAATGTGAATAAAGATGCCGTCACAG 

ATAGATTGGCTTCAGTGGAGACTGATATGCCTCTAACATTGAGACAGCATAGAATAAGTG 

CGACATCATCATCGGAAGAGAGTAGTAACAAAGGTCAAAGACAGTTGACTGTATCGTCGA 

GGTCGAATCAAACAAGTTTGTACAAAAAAGCTGAACGAGAAACGTAAAATGATATAAATA 

TCMTATATTAMTTAGATTTTGCATAAAAAACAGACTACATAATACTGTAAMCACAAC 

ATATCCAGTCACTATGGCGGCCGCTAAGTTGGCAGCATCACCCGACGCACTTTGCGCCGA 

ATAAATACCTGTGACGGAAGATCACTTCGCAGAATAAATAAATCCTGGTGTCCCTGTTGA 

TACCGGGAAGCCCTGGGCCAACTTTTGGCGAAAATGAGACGTTGATCGGCACGTAAGAGG 

T TCCM CTTTCACCATMTGAMTMGATCACTACCGGGCGTATTTTTTGAGTTATCGAG 

ATTTTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAATCACTGGATATACCACCGTTGA 

TATATCCCMTGGCATCGTAMGMCATTTTGAGGCATTTCAGTCAGTTGCTCAATGTAC 

CTATMC CAGA CCGTTCAGCTGGATATTACGGCCTTTTTAAAGACCGTAAAGAAAAATAA 

GCACMGTTTTATCCGGCCTTTATTCACATTCTTGCCCGCCTGATGAATGCTCATCCGGA 

ATTCC GTAT GGCAATGAAAGACGGTGAGCTGGTGATATGGGATAGTGTTCACCCTTGTTA 

CACCGTTTTCCATGAGCAMCTGAMCGTTTTCATCGCTCTGGAGTGAATACCACGACGA 

TTTCCGGCAGTTTCTACACATATATTCGCAAGATGTGGCGTGTTACGGTGAAAACCTGGC 
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CTATTTCCC TAM GGGTTTATTGA6AATATGTTTTTCGTCTCAGCCAATCCCTGGGTGAG 
TTTCACCAGTTTTGATTrAMCGTGGCCMTATGGACMCTTCTTCGCCCCCGTTTTCAC 
CATGGGCAAATATTATACGCAAGGCGACAAGGTGCTGATGCCGCTGGCGATTCAGGTTCA 
TCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGCTTAATGAATTACMCAGTACTG 
CGATGAGTGGCAGGGCGGGGCGTAATCTAGAGGATCCGGCTTACTAAAAGCCAGATAACA 
GTATGCGTATTTGCGCGCTGATTTTTGCGGTATAAGAATATATACTGATATGTATACCCG 
AAGTATGTCAAAAAGAGGTGTGCTATGAAGCAGCGTATTACAGTGACAGTTGACAGCGAC 
AGCTATCAGTTGCTCAAGGCATATATGATGTCAATATCTCCGGTCTGGTAAGCACAACCA 
TGCAGAATGAAGCCCGTCGTCTGCGTGCCGAACGCTGGAAAGCGGAAAATCAGGAAGGGA 
TGGCTGAGGTCGCCCGGTTTATTGAMTGMCGGCTCTTTTGCTGACGAGAACAGGGACT 
GGTGAMTGCAGTTTMGGTTTACACCTATAAMGAGAGAGCCGTTATCGTCTGTTTGTG 
GATGTACAGAGTGATATTATTGACACGCCCGGGCGACGGATGGTGATCCCCCTGGCCAGT 
GCACGTCTGCTGTCAGATAAAGTCTCCCGTGAACTTTACCCGGTGGTGCATATCGGGGAT 
GAAAGCTGGCGCATGATGACCACCGATATGGCCAGTGTGCCGGTCTCCGTTATCGGGGAA 
GAAGTGGCTGATCTCAGCCACCGCGAAAATGACATCAAAAACGCCATTAACCTGATGTTC 
TGGGGAATATAAATGTCAGGCTCCCTTATACACAGCCAGTCTGCAGGTCGACCATAGTGA 
CTGGATATGTTGTGTTTTACAGTAnATGTAGTCTGI I I I I I ATGCAAAATCTAATTTAA 
TATATTGATATTTATATCATTTTACGTTTCTCGTTCAGCTTTCTTGTACAMGTGGTTTG 
ATGGCCGCTAAGTAAGTAAGACGTCGAGCTCTAAGTAAGTAACGGCCGCCACCGCGGTGG 
AGCTTTGGACTTCTTCGCCAGAGGTTTGGTCAAGTCTCCAATCAAGGTTGTCGGCTTGTC 
TACCTTGCCAGAMTTTACGAAAAGATGGAAAAGGGTCAAATCGTTGGTAGATACGTTGT 
TGACACTTCTAAATAAGCGAATTTCTTATGATTTATGA I I I I I ATTATTAAATAAGTTAT 
AAAAAAMTMGTGTATACAAATTTTAMGTGACTCTTAGGTTTTAAAACGAAAATTCTT 
GTTCTTGAGTMCTCTTTCCTGTAGGTCAGGTTGCTTTCTCAGGTATAGCATGAGGTCGC 
TCTTATTGACCACACCTCTACCGGCATGCCGAGCAAATGCCTGCAAATCGCTCCCCATTT 
CACCCMTTGTAGATATGCTMCTCCAGCMTGAGTTGATGAATCTCGGTGTGTATTTTA 
TGTCCTCAGAGGACAATACCTGTTGTAATCGTTCTTCCACACGGATCCCAATTCGCCCTA 
TAGTGAGTCGTATTACAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCC 
TGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAG 
CGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGAC 
GCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCT 
ACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACG 
TTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGT 
GCTTTACGGCACCTCG ACCCC AAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCA 
TCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGA 
CTCT TGTT CCAMCTGGMCMCACTCMCCCTATCTCGGTCTATTCTTTTGATTTATAA 
GGGATTTTGCCGATTTCGGCCTATTGGTTAAAAMTGAGCTGATTTMCAAAMTTTAAC 
GCGMTTTTMCAAMTATTMCGTTTACMTTTCCTGATGCGGTATTTTCTCCTTACGC 
ATCTGTGCGGTATTTCACACCGCATATCGACCGGTCGAGGAGAACTTCTAGTATATCCAC 
ATACCTMTATTATTGCCTTATTAAAAATGGAATCGGAACAATTACATCAAAATCCACAT 
TCTCTTCAAMTCMTTGTCCTGTACTTCCTTGTTCATGTGTGTTCAAAAACGTTATATT 
TATAGGATMTOTACTCTATTTCTCAACAAGTAATTGGTTGTTTGGCCGAGCGGTCTAA 
GGCGCCTGATTCAAGAAATATCTTGACCGCAGTTAACTGTGGGAATACTCAGGTATCGTA 
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AGATGCMGAGTTCGMTCTCTTAGCMCCATTATTTTTTTCCTCAACATAACGAGAACA 
CACAGGGGCGCTATCGCACAGMTCAMTTCGATGACTGGAMTTTTTTGTTAATTTCAG 
AGGTCGCCTGACGCATATACCTTTTTCAACTGAAAAATTGGGAGAAAAAGGAAAGGTGAG 
AGGCCGGMCCGGCTTTTCATATAGAATAGAGAAGCGTTCATGACTAAATGCTTGCATCA 
CMTACTTGMGTTGACMTATTATTTMGGACCTATTGTTTTTTCCAATAGGTGGTTAG 
CMTCGTCTTACTTTCTMCTTTTCTTACCTTTTACATTTCAGCAATATATATATATATT 
TCAAGGATATACCATTCTAATGTCTGCCCCTATGTCTGCCCCTAAGAAGATCGTCGTTTT 
GCCAGGTGACCACGTTGGTCAAGAAATCACAGCCGAAGCCATTAAGGTTCTTAAAGCTAT 
TTCTGATGTTCGTTCCMTGTCMGTTCGATTTCGAAMTCATTTAATTGGTGGTGCTGC 
TATCG ATGCT ACAGGTGTCCCACTTCCAGATGAGGCGCTGGAAGCCTCCAAGAAGGTTGA 
TGCCGTTTTGTTAGGTGCTGTGGGTGGTCCTAAATGGGGTACCGGTAGTGTTAGACCTGA 
ACAAGGTTTACTAAAAATCCGTAAAGAACTTCAATTGTACGCCAACTTAAGACCATGTAA 
CTTTGCATCCGACTCTCTTTTAGACTTATCTCCAATCAAGCCACAATTTGCTAMGGTAC 
TGACTTCGTTGTTGTCAGAGMTTAGTGGGAGGTATTTACTTTGGTAAGAGAAAGGAAGA 
CGATGGTGATGGTGTCGCTTGGGATAGTGAACAATACACCGTTCCAGAAGTGCAAAGMT 
CACMGMTGGCCGCTTTCATGGCCCTACAACATGAGCCACCATTGCCTATTTGGTCCTT 
GGATAMGCTMTGTTTTGGCCTCTTCAAGATTATGGAGAAAAACTGTGGAGGAAACCAT 
CMGAACGAATTCCCTACATTGAAGGTTCAACATCAATTGATTGATTCTGCCGCCATGAT 
CCTAGTTAAGAACCCAACCCACCTAAATGGTATTATAATCACCAGCAACATGTTTGGTGA 
TATCATCTCCGATGMGCCTCCGTTATCCCAGGTTCCTTGGGTTTGTTGCCATCTGCGTC 
CTTGGCCTCTTTGCCAGACMGMCACCGCATTTGGTTTGTACGAACCATGCCACGGTTC 
TGCTCCAGATTTGCCAAAGAATAAGGTTGACCCTATCGCCACTATCTTGTCTGCTGCAAT 
GATGTTGAAATTGTCATTGAACTTGCCTGAAGAAGGTAAGGCCATTGAAGATGCAGTTAA 
AMGGTTTTGGATGCAGGTATCAGAACTGGTGATTTAGGTGGTTCCAACAGTACCACCGA 
AGTCGGTGATGCTGTCGCCGMGMGTTMGAAMTCCTTGCTTAAAAAGATTCTCTTTT 
TTTATGATATTTGTACATAMCTTTATAAATGAAATTCATAATAGAAACGACACGAAATT 
ACAAMTGGAATATGTTCATAGGGTAGACGAAACTATATACGCAATCTACATACATTTAT 
CAAGAAGGAGAAAAAGGAGGATAGTAAAGGAATACAGGTAAGCAAATTGATACTAATGGC 
TCMCGTGATMGGAAAMGMTTGCACTTTAACATTAATATTGACAAGGAGGAGGGCAC 
CACA CAAAM GTTAGGTGTAACAGAAAATCATGAAACTACGATTCCTAATTTGATATTGG 
AGGATTTTCTCTAAAAAAAAAAAAATACAACAAATAAAAAACACTCAATGACCTGACCAT 
TTGATGGAGTTTAAGTCAATACCTTCTTGAACCATTTCCCATAATGGTGAAAGTTCCCTC 
AAGAATTTTACTCTGTCAGAAACGGCCTTACGACGTAGTCGATATGGTGCACTCTCAGTA 
CAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACG 
CGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCG 
GGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAACGCGCGA 
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GCCTTACGCATCTGTGCGGTATTTCACACCGCAGGCAAGTGCACAAACAATACTTAAATA 
MTACTACTCAGTMTMCCTATTTCTTAGCATTTTTGACGAMTTTGCTATTTTGTTAG 
AGTCTTTTACACCATTTGTCTCCACACCTCCGCTTACATCAACACCAATAACGCCATTTA 
ATCTAAGCGCATCACCAACATTTTCTGGCGTCAGTCCACCAGCTAACATAAAATGTAAGC 
TTTCGGGGCTCTCTTGCCTTCCAACCCAGTCAGAAATCGAGTTCCAATCCAAAAGTTCAC 
CTGTCCCACCTGCTTCTGAATCAAACAAGGGMTAAACGAATGAGGTTTCTGTGAAGCTG 
CACTGAGTAGTATGTTGCAGTCTTTTGGAMTACGAGTCTTTTAATAACTGGCAAACCGA 
GGAACTCTTGGTATTCTTGCCACGACTCATCTCCATGCAGTTGGACGATATCAATGCCGT 
MTCATTGACCAGAGCCAAAACATCCTCCTTAGGTTGATTACGAAACACGCCAACCAAGT 
ATTTCGGAGTGCCTGMCTATTT1TATATGCTTTTACMGACTTGAMTTTTCCTTGCAA 
TAACCGGGTCAATTGTTCTCTTTCTATTGGGCACACATATAATACCCAGCAAGTCAGCAT 
CGGAATCTAGAGCACATTCTGCGGCCTCTGTGCTCTGCAAGCCGCAAACTTTCACCAATG 
GACCAGAACTACCTGTGAAATTAATAACAGACATACTCCAAGCTGCCTTTGTGTGCTTAA 
TCACGTA TACTCACGTGCTCMTAGTCACCMTGCCCTCCCTCTTGGCCCTCTCCTTTTC 
l l I I I I CGACCGAATTAATTCTTAATCGGCAAAAAAAGAAAAGCTCCGGATCAAGATTGT 
ACGTMGGTGACMGCTATTTTTCAATAAAGAATATCTTCCACTACTGCCATCTGGCGTC 
ATAACTGCAAAGTACACATATATTACGATGCTGTCTATTAAATGCTTCCTATATTATATA 
TATAGTAATGTCGTTTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAA 
GCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGG 
CATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCAC 
CGTCATCACCGAAACGCGCGAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTA 
ATGTCATGATAATAATGGTTTCTTAGGACGGATCGCTTGCCTGTAACTTACACGCGCCTC 
GTATCTmMTGATGGMTMTTTGGGMmACTCTGTGTTTAmATTmATGTTT 
TGTATTTGGATTTTAGAAAGTAAATAAAGAAGGTAGAAGAGTTACGGAATGAAGAAAAAA 
AMTAMCAMGGTTTAAAAMTTTCMCAAAMGCGTACTTTACATATATATTTATTAG 
ACAAGAAAAGCAGATTAAATAGATATACATTCGATTAACGATAAGTAAAATGTAAAATCA 
CAGGATTTTCGTGTGTGGTCTTCTACACAGACAAGATGAAACAATTCGGCATTAATACCT 
GAGAGCAGGMGAGCMGATAAMGGTAGTATTTGTTGGCGATCCCCCTAGAGTCTTTTA 
CATCTTCGGAAMCAAAMCTATTTTTTCTTTAATTTC I I I I I I I ACTTTCTATTTTTAA 
TTTATATATTTATATTAAAAMTTTAMTTATMTTATTTTTATAGCACGTGATGAAM 
GACCCAGGTGGCACTTTTCGGGGAMTGTGCGCGGMCCCCTATTTGTTTATTTTTCTAA 
ATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATAT 
TG A^AMG GMGAGTATGAGTATTCMCAmCCGTGTCGCCCmTTCCCTTTTTTGCG 
GCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAA 
GATC AGTTG GGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTT 
GAGAGTTTTCGCCCCGMGMCGTT7TCCMTGATGAGCACTTTTAAAGTTCTGCTATGT 
GGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTAT 
TCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATG 
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ACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTA 
CTTCTGACMCGATCGGAGGACCGMGGAGCTMCCGCTTTTTTTCACAACATGGGGGAT 
CATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAG 
CGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAA 
CTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCA 
GGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCC 
GGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGT 
ATCGTAGTTATCTACACGACGGGCAGTCAGGCAACTATGGATGAACGAAATAGACAGATC 
GCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATAT 
ATACTTTAGATTGATTTAAMCTTCATTTTTMTTTAAMGGATCTAGGTGMGATCCTT 
TTTGATMTCTCATGACCAAMTCCCTTMCGTGAGTTTTCGTTCCACTGAGCGTCAGAC 
CCCGTAGAAMGATCAMGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGC 
TTGC AAACAA AAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCA 
ACTC I I I I I CCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTA 
GTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCT 
CTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTG 
GACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGC 
ACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCAT 
TGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGG 
GTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGGAACGCCTGGTATCTTTATAGT 
CCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGG 
CC GAGC CTATGGAAAMCGCCAGCMCGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGG 
CCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACC 
GCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTG 
AGCGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATT 
CATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCA 
ATTAATGTGAGTTACCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCT 
CCTATGTTGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCAT 
GATTACGCCAAGCTCGGAATTAACCCTCACTAAAGGGAACAAAAGCTGGGTACCGGGCCC 
CCCCTCGAGATCCGGGATCGAAGAAATGATGGTAAATGAAATAGGAAATCAAGGAGCATG 
AAGGCAAAAGACAAATATAAGGGTCGAACGAAAAATAAAGTGAAAAGTGTTGATATGATG 
TATTTGGCTTTGCGGCGCCGAAAAAACGAGTTTACGCAATTGCACAATCATGCTGACTCT 
GTGG CGGACC CGCGCTCTTGCCGGCCCGGCGATAACGCTGGGCGTGAGGCTGTGCCCGGC 
GGAG l l l l l l GCGCCTGCATTTTCCAAGGTTTACCCTGCGCTAAGGGGCGAGATTGGAGA 
AGCAATAAGAATGCCGGTTGGGGTTGCGATGATGACGACCACGACAACTGGTGTCATTAT 
TTAAGTTGCCGAAAGAACCTGAGTGCATTTGCAACATGAGTATACTAGAAGAATGAGCCA 
AGACTTGCGAGACGCGAGTTTGCCGGTGGTGCGAACAATAGAGCGACCATGACCTTGAAG 
GTGAGACGCGCATAACCGCTAGAGTACTTTGAAGAGGAAACAGCAATAGGGTTGCTACCA 
GTATAAATAGACAGGTACATACAACACTGGAAATGGTTGTCTGTTTGAGTACGCTTTCAA 
TTCATTTGGGTGTGCACTTTATTATGTTACAATATGGAAGGGAACTTTACACTTCTCCTA 
TGCACATATATTAATTAAAGTCCAATGCTAGTAGAGAAGGGGGGTAACACCCCTCCGCGC 
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TCTTTTCCGATTTTTTTCTAMCCGTGGMTATTTCGGATATCCTTTTGTTGTTTCCGGG 
TGTACAATATGGACTTCCTCTTTTCTGGCAACCAAACCCATACATCGGGATTCCTATAAT 
ACCTTCGTTGGTCTCCCTAACATGTAGGTGGCGGAGGGGAGATATACAATAGAACAGATA 
CCAGACAAGACATAATGGGCTAAACAAGACTACACCAATTACACTGCCTCATTGATGGTG 
GTACATA ACGAA CTAATACTGTAGCCCTAGACTTGATAGCCATCATCATATCGAAGTTTC 
ACTACCCTT TTTCCATTTGCCATCTATTGMGTMTMTAGGCGCATGCMCTTCTTTTC 
I I I I I I I I I CTTTTCTCTCTCCCCCGTTGTTGTCTCACCATATCCGCAATGACAAAAAAA 
ATGATGGAAGACACTAAAGGAAAAAATTAACGACAAAGACAGCACCAACAGATGTCGTTG 
TTCCAGAGCTGATGAGGGGTATCTTCGMCACACGAMCTTTTTCCTTCCTTCATTCACG 
CACACTACTCTCTAATGAGCAACGGTATACGGCCTTCCTTCCAGTTACTTGAATTTGAAA 
TAAAAAMGTTTGCCGCTTTGCTATCMGTATAMTAGACCTGCMTTATTAATCTTTTG 
TTTCCTCGTCATTGTTCTCGTTCCCTTTCTTCCTTGTTTCTTTTTCTGCACMTATTTCA 
AGCTATACCAAGCATACAATCAACTCCAAGCTTATGCCCAAGAAGAAGCGGAAGGTCTCG 
AGCGGCGCCMTTTTMTCAMGTGGGMTATTGCTGATAGCTCATTGTCCTTCACTTTC 
ACTAACAGTAGCAACGGTCCGAACCTCATAACAACTCAAACAAATTCTCAAGCGCTTTCA 
CAACCAATTGCCTCCTCTAACGTTCATGATAACTTCATGAATAATGAAATCACGGCTAGT 
AAAATTGATGATGGTAATAATTCAAAACCACTGTCACCTGGTTGGACGGACCAAACTGCG 
TATAACGCGTTTGGAATCACTACAGGGATGTTTAATACCACTACAATGGATGATGTATAT 
AACTATCTATTCGATGATGAAGATACCCCACCAAACCCAAAAAAAGAGGGTGGGTCGAAT 
CAAACAAGTTTGTACAAAAAAGCTGAACGAGAAACGTAAAATGATATAAATATCAATATA 
TTAMTTAGATTTTGCATAAAAAACAGACTACATAATACTGTAAAACACAACATATCCAG 
TCACTATGGCGGCCGCTAAGTTGGCAGCATCACCCGACGCACTTTGCGCCGAATAAATAC 
CTGTGACGGAAGATCACTTCGCAGAATAAATAAATCCTGGTGTCCCTGTTGATACCGGGA 
AGCCCTGGGCCMCTTTTGGCGAAAATGAGACGTTGATCGGCACGTAAGAGGTTCCAACT 
TTCACCATAATGAAATAAGATCACTACCGGGCGTA I 1 1 I I I GAGTTATCGAGATTTTCAG 
GAGCTAAGGAAGCTAAAATGGAGAAAAAAATCACTGGATATACCACCGTTGATATATCCC 
MTGGCATCGTAMGMCATTTTGAGGCATTTCAGTCAGTTGCTCAATGTACCTATAACC 
AGACCGTTCAGCTGGATATTACGGCCTTTTTAAAGACCGTAAAGAAAAATAAGCACAAGT 
TTTATCCGGCCTTTATTCACATTCTTGCCCGCCTGATGAATGCTCATCCGGAATTCCGTA 
TGGCAATGAAAGACGGTGAGCTGGTGATATGGGATAGTGTTCACCCTTGTTACACCGTTT 
TCCATGAGCAMCTGAMCGTTTTCATCGCTCTGGAGTGAATACCACGACGATTTCCGGC 
AGTTTCTACACATATATTCGCAAGATGTGGCGTGTTACGGTGAAAACCTGGCCTATTTCC 
C TAM GGGTTTATTGAGMTATGTTTTTCGTCTCAGCCAATCCCTGGGTGAGTTTCACCA 
GTTTTGATTTAMCGTGGCCMTATGGACMCTTCTTCGCCCCCGTTTTCACCATGGGCA 
AATATTATACGCAAGGCGACAAGGTGCTGATGCCGCTGGCGATTCAGGTTCATCATGCCG 
TCTGTGATGGCTTCCATGTCGGCAGAATGCTTAATGAATTACAACAGTACTGCGATGAGT 
GGCAGGGCGGGGC GTAAT CTAGAGGATCCGGCTTACTAAAAGCCAGATAACAGTATGCGT 
ATTTGCGCGCTGATTTTTGCGGTATMGAATATATACTGATATGTATACCCGAAGTATGT 
CAAAAAGAGGTGTGCTATGAAGCAGCGTATTACAGTGACAGTTGACAGCGACAGCTATCA 
GTTGCTCAAGGCATATATGATGTCAATATCTCCGGTCTGGTAAGCACAACCATGCAGAAT 
GAAGCCCGTCGTCTGCGTGCCGAACGCTGGAAAGCGGAAAATCAGGAAGGGATGGCTGAG 
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GTCGCCCGGTTTATTGAAATGAACGGCTCTTTTGCTGACGAGAACAGGGACTGGTGAAAT 
GCAGTTTMGGTTTACACCTATAAAAGAGAGAGCCGTTATCGTCTGTTTGTGGATGTACA 
GAGTGATATTATTGACACGCCCGGGCGACGGATGGTGATCCCCCTGGCCAGTGCACGTCT 
GCTGTCAGATAAAGTCTCCCGTGAACTTTACCCGGTGGTGCATATCGGGGATGAAAGCTG 
GCGCATGATGACCACCGATATGGCCAGTGTGCCGGTCTCCGTTATCGGGGAAGAAGTGGC 
TGATCTCAGCCACCGCGAAAATGACATCAAAAACGCCATTAACCTGATGTTCTGGGGAAT 
ATAAATGTCAGGCTCCGTTATACACAGCCAGTCTGCAGGTCGACCATAGTGACTGGATAT 
GTTGTGTT1TACAGTATTATGTAGTCTGTTTTTTATGCAAMTCTMTTTAATATATTGA 
TATTTATATCATTTTACGTTTCTCGTTCAGCTTTCTTGTACAMGTGGTTTGATGGCCGC 
TAAGTAAGTAAGACGTCGAGCTCCCTATAGTGAGTCGTATTACACTGGCCGTCGTTTTAC 
AACGTCGTGACTGGGAAAACACCGGTGAGCTCTAAGTAAGTAACGGCCGCCACCGCGGTG 
GAGCTTTGGACTTCTTCGCCAGAGGTTTGGTCAAGTCTCCAATCAAGGTTGTCGGCTTGT 
CTACCTTGCCAGAMTTTACGAAAAGATGGAAAAGGGTCAAATCGTTGGTAGATACGTTG 
TTGACACTTCTAMTMGCGMTTTCTTATGATTTATGATTTTTATTATTAAATAAGTTA 
TAAAAAAMTMGTGTATACAMTTTTAMGTGACTCTTAGGTTTTAAAACGAAAATTCT 
TGTTCTTGAGTMCTCTTTCCTGTAGGTCAGGTTGCTTTCTCAGGTATAGCATGAGGTCG 
CTCTTATTGACCACACCTCTACCGGCATGCCGAGCAAATGCCTGCAAATCGCTCCCCATT 
TCACCCMTTGTAGATATGCTMCTCCAGCMTGAGTTGATGAATCTCGGTGTGTATTTT 
ATGTCCTCAGAGGACAATACCTGTTGTAATCGTTCTTCCACACGGATCCGCATCAGGCGA 
MTTGTAMCG7TMTATTTTGTTAAMTTCGCGTTAMTATTTGTTAAATCAGCTCATT 
TTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAGACCGAGAT 
AGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTGGACTCCAA 
CGTCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGCCCACTACGTGAACCATCACCCTA 
ATCAAG I I I I I I GGGGTCGAGGTGCCGTAAAGCACTAAATCGGAACCCTAAAGGGAGCCC 
CCGATTTAGAGCTTGACGGGGAAAGCCGGCGAACGTGGCGAGAAAGGAAGGGAAGAAAGC 
GAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCGGTCACGCTGCGCGTAACCACCAC 
ACCCGCCGCGCTTAATGCGCCGCTACAGGGCGCGTCCCATTCGCCATTCACTGCA 
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pDEST34 7114 bp 



Location (Base Nos.) 
195. .71 
304.. 963 



Gene Encoded 
attRl 



CmR 
ccdB 



1305.. 1610 
1651.. 1775 
1780 . . 2472 
2675 . . 2720 
3334. .4194 
4343.. 4982 



T7stop 

ampR 

on' 



attR2 
GST 



atcgagatctcgatcccgcgaaattmtacgactcactatagggagaccacaacggtttc 
cctctagatcacmgtttgtacaaaaaagctgaacgagaaacgtaaaatgatataaatat 
cmtatattamttagattttgcataaaaaacagactacataatactgtaaaacacaaca 
tatccagtcactatggcggccgcattaggcaccccaggctttacactttatgcttccggc 
tcgtatmtgtgtggattttgagttaggatccggcgagattttcaggagctaaggaagct 
aaaa tggaga aaaaaatcactggatataccaccgttgatatatcccaatggcatcgtaaa 
gmcattttgag gcattt cagtcagttgctcaatgtacctataaccagaccgttcagctg 
gatattacggcctttttaaagaccgtaaagaaamtmgcacmgttttatccggccttt 
attcacattcttgcccgcctgatgmtgctcatccggmttccgtatggcaatgaaagac 
ggtgag ctggt gatatgggatagtgttcacccttgttacaccgttttccatgagcaaact 
gamcgttttcatcgctctggagtgmtaccacgacgatttccggcagtttctacacata 
tattcgc mgatgt ggcgtgttacggtgaamcctggcctatttccctaaagggtttatt 
gagmtatgttt1tcgtctcagccmtccctgggtgagtttcaccagttttgatttaaac 
gtggccmtatggacmcttcttcgcccccgttttcaccatgggcaaatattatacgcaa 
ggcgacaaggtgctgatgccgctggcgattcaggttcatcatgccgtctgtgatggcttc 
catgtcggcagaatgcttmtgaattacaacagtactgcgatgagtggcagggcggggcg 
taaa cgcgtggatccggcttactaaaagccagataacagtatgcgtatttgcgcgctgat 
ttttgcggtataagaatatatactgatatgtatacccgaagtatgtcaaaaagaggtgtg 
ctatgmgcagcgta™cagtgacagttgacagcgacagctatcag1tgctcaaggcat 
atatgatgtcaatatctccggtctggtaagcacaaccatgcagaatgaagcccgtcgtct 
gcgtgccgaacgctg gaaag cggaaaatcaggmgggatggctgaggtcgcccggtttat 
tgamtgmcggctctt1tgctgacgagmcagggactggtgamtgcagtttaaggttt 
acacctataamgagagagccgtotcgtctgtttgtggatgtacagagtgatattattg 
acacgcccgggcgacggatggtgatccccctggccagtgcacgtctgctgtcagataaag 
tctcccgtgaactttacccggtggtgcatatcggggatgaaagctggcgcatgatgacca 
ccgatatggccagtgtgccggtctccgttatcggggaagaagtggctgatctcagccacc 
gcgaaaatgacatcaaaaacgccattaacctgatgttctggggaatataaatgtcaggct 
cccttatacacagc cagtct gcaggtcgaccatagtgactggatatgttgtgttttacag 
tattatgtagtctgttttttatgcaamtctmtttmtatattgatatttatatcatt^ 
tacgtttctcgttcagctttcttgtacaaagtggtgattatgtcccctatactaggttat 
tggaamttmgggccttgtgcmcccactcgacttcttttggaatatcttgaagaaaaa 
tatgmgagcatttgtatgagcgcgatgaaggtgataaatggcgaaacaaaaagtttgm 



FIG.96B 



241/253 



TTGGGTTTGGAGTTTCCCAATCTTCCTTATTATATTGATGGTGATGTTAAATTAAeACAG 

TCTATGGCCATCATACGTTATATAGCTGACAAGCACAACATGTTGGGTGGTTGTCCAAAA 

GAGCGTGCAGAGATTrCMTGCTTGMGGAGCGGTTTTGGATATTAGATACGGTGTTTCG 

AGMTTGCATATAGTAMGACTTTGAMCTCTCAMGTTGATTTTCTTAGCAAGCTACCT 

GAMTGCTGAAMTGTTCGMGATCGTTTATGTCATAAAACATATTTAAATGGTGATCAT 

GTMCCCATCCTGACTTCATGTTGTATGACGCTCTTGATGTTGTTTTATACATGGACCCA 

ATGTGCCTGGATGCGTTCCCAAMTTAGTTTGTTTTAAAAAACGTATTGAAGCTATCCCA 

CAAATTGATAAGTACTTGAAATCCAGCAAGTATATAGCATGGCCTTTGCAGGGCTGGCAA 

GCCACGTTTGGTGGTGGCGACCATCCTCCAAAATCGGATCTGGTTCCGCGTCCATGGGGA 

TCCGGCTGCTAACAMGCCCGAAAGGAAGCTGAGTTGGCTGCTGCCACCGCTGAGCGCTT 

CCCGATAAGGGAGCAGGCCAGTAAAAGCATTACCCGTGGTGGGGTTCCCGAGCGGCCAAA 

GGGAGCAGACTCTAAATCTGCCGTCATCGACTTCGAAGGTTCGAATCCTTCCCCCACCAC 

CATCACTTTCAAAAG TGAATT CGCTGAGCAATAACTAGCATAACCCCTTGGGGCCTCTAA 

ACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGGAACTATATCCGGATATCCACAGGACGG 

GTGTGGTCGCCATGATCGCGTAGTCGATAGTGGCTCCAAGTAGCGAAGCGAGCAGGACTG 

GGCGGCGGCCAAAGCGGTCGGACAGTGCTCCGAGAACGGGTGCGCATAGAAATTGCATCA 

ACGCATATAGCGCTAGCAGCACGCCATAGTGACTGGCGATGCTGTCGGAATGGACGATAT 

CCCGCAAGAGGCCCGGCAGTACCGGCATAACCAAGCCTATGCCTACAGCATCCAGGGTGA 

CGGTGCCGAGGATGACGATGAGCGCATTGTTAGATTTCATACACGGTGCCTGACTGCGTT 

AGCMTTTAACTGTGATAAACTACCGCATTAAAGCTTATCGATGATAAGCTGTCAAACAT 

GAGMTTCTTGMGACGAMGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATG 

ATMTMTG GTTTC TTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCT 

ATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGA 

TAMTGCTT CAATAA TATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCC 

CTTATTCCC I I 1 1 1 1 GCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTG 

AAAGTAAAAGATGCTGMGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTC 

MCAGCGGTMGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACT 

TTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTGTTGACGCCGGGCAAGAGCAACTC 

GGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAG 

CATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGAT 

MCACTGCGGCCMCTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTT 

TTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAA 

GCCATACCAAACGACGAGCGTGACACCACGATGCCTGCAGCAATGGCAACAACGTTGCGC 

AAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATG 

GAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATT 

GCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCA 

GATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGAT 

GAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCA 

GACCMGTTTACTCAT ATATA CmAGATTGAmAAMCmATTmMmAAAAGG 

ATCTAGGTGMGATCCTTTTTGATMTCTCATGACCAAMTCCCTTMCGTGAGTTTTCG 

TTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCC I I I I I I I 

CTGCGCGTMTCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTG 

CCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATA 

CCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCA 

CCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAG 

TCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGC 
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TGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGA 

TACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGG 

TATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAAC 

GCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTG 

TGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGG 

TTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCT 

GTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACC 

GAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCTGATGCGGTATTTTCTCCTT 

ACGCATCTGTGCGGTATTTCACACCGCATATATGGTGCACTCTCAGTACAATCTGCTCTG 

ATGCCGCATAGTTAAGCCAGTATACACTCCGCTATCGCTACGTGACTGGGTCATGGCTGC 

GCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATC 

CGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTC 

ATCACCGAAACGCGCGAGGCAGCTGCGGTAAAGCTCATCAGCGTGGTCGTGAAGCGATTC 

ACAGATGTCTGCCTGTTCATCCGCGTCCAGCTCGTTGAGTTTCTCCAGAAGCGTTAATGT 

CTGGCTTCTGATAAAGCGGGCCATGTTAAGGGCGG I I I I I I CCTGTTTGGTCACTGATGC 

CTCCGTGTAAGGGGGATTTCTGTTCATGGGGGTAATGATACCGATGAAACGAGAGAGGAT 

GCTCACGATACGGGTTACTGATGATGAACATGCCCGGTTACTGGAACGTTGTGAGGGTAA 

ACAACTGGCGGTATGGATGCGGCGGGACCAGAGAAAAATCACTCAGGGTCAATGCCAGCG 

CTTCGTTAATACAGATGTAGGTGTTCCACAGGGTAGCCAGCAGCATCCTGCGATGCAGAT 

CCGGMCATMTGGTGCAGGGCGCTGACTTCCGCGTTTCCAGACTTTACGAAACACGGAA 

ACCGMGACCATTCATGTTGTTGCTCAGGTCGCAGACGTTTTGCAGCAGCAGTCGCTTCA 

CGTTCGCTCGCGTATCGGTGATTCATTCTGCTAACCAGTAAGGCAACCCCGCCAGCCTAG 

CCGGGTCCTCAACGACAGGAGCACGATCATGCGCACCCGTGGCCAGGACCCAACGCTGCC 

CGAGATGCGCCGCGTGCGGCTGCTGGAGATGGCGGACGCGATGGATATGTTCTGCCAAGG 

GTTGGTTTGCGCATTCACAGTTCTCCGCAAGAATTGATTGGCTCCAATTCTTGGAGTGGT 

GAATCCGTTAGCGAGGTGCCGCCGGCTTCCATTCAGGTCGAGGTGGCCCGGCTCCATGCA 

CCGCGACGCAACGCGGGGAGGCAGACAAGGTATAGGGCGGCGCCTACAATCCATGCCAAC 

CCGTTCCATGTGCTCGCCGAGGCGGCATAAATCGCCGTGACGATCAGCGGTCCAGTGATC 

GAAGTTAGGCTGGTAAGAGCCGCGAGCGATCCTTGAAGCTGTCCCTGATGGTCGTCATCT 

ACCTGCCTGGACAGCATGGCCTGCAACGCGGGCATCCCGATGCCGCCGGAAGCGAGAAGA 

ATCATAATGGGGAAGGCCATCCAGCCTCGCGTCGCGAACGCCAGCAAGACGTAGCCCAGC 

GCGTCGGCCGCCATGCCGGCGATAATGGCCTGCTTCTCGCCGAAACGTTTGGTGGCGGGA 

CCAGTGACGAAGGCTTGAGCGAGGGCGTGCAAGATTCCGAATACCGCAAGCGACAGGCCG 

ATCATCGTCGCGCTCCAGCGAAAGCGGTCCTCGCCGAAAATGACCCAGAGCGCTGCCGGC 

ACCTGTCCTACGAGTTGCATGATAAAGAAGACAGTCATAAGTGCGGCGACGATAGTCATG 

CCCCGCGCCCACCGGAAGGAGCTGACTGGGTTGAAGGCTCTCAAGGGCATCGGTCGATCG 

ACGCTCTCCCTTATGCGACTCCTGCATTAGGAAGCAGCCCAGTAGTAGGTTGAGGCCGTT 

GAGCACCGCCGCCGCAAGGAATGGTGCATGCAAGGAGATGGCGCCCAACAGTCCCCCGGC 

CACGGGGCCTGCCACCATACCCACGCCGAAACAAGCGCTCATGAGCCCGAAGTGGCGAGC 

CCGATCTTCCCCATCGGTGATGTCGGCGATATAGGCGCCAGCAACCGCACCTGTGGCGCC 

GGTGATGCCGGCCACGATGCGTCCGGCGTAGAGG 
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GCGAGA GTAG G6AACTGCCAGGCATCAAATAAAAC6AAAGGCTCAGTCGGAAGACTGGGC 
CTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCCTGAGTAGGACAAATCCGCCGGG 
AGCGGATTTGAACGTTGTGAAGCAACGGCCCGGAGGGTGGCGGGCAGGACGCCCGCCATA 
MCTGCCAGGCATCAMCTMGCAGMGGCCATCCTGACGGATGGCCTTTTTGCGTTTCT 
ACAAACTCTTCCTGGCTAGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGA 
AAG AACAT GTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTG 
GCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAG 
AGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTC 
GTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCG 
GGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTT 
CGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCC 
GGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCC 
ACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGG 
TGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCA 
GTTAC CTTCGGAA AAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGC 
GGTGGTTTT TTTGT TTGCMGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGAT 
CCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATT 
TTGGTCATGAGCTTGCGCCGTCCCGTCAAGTCAGCGTAATGCTCTGCCAGTGTTACAACC 
MTTMCCMTTCTGATTAGAAAAACTCATCGAGCATCAAATGAAACTGCAATTTATTCA 
TATCAGGATTATCMTACCATATTTTTGAAAAAGCCGTTTCTGTAATGAAGGAGAAAACT 
CACCGAGGCAGTTCCATAGGATGGCAAGATCCTGGTATCGGTCTGCGATTCCGACTCGTC 
CAACATCAATACAACCTATTAGTAGCCAACCACTAGAACTATAGCTAGAGTCCTGGGCGA 
ACAAACGATGCTCGCCTTCCAGAAAACCGAGGATGCGAACCACTTCATCCGGGGTCAGCA 
CCACCGGCAAGCGCCGCGACGGCCGAGGTCTTCCGATCTCCTGAAGCCAGGGCAGATCCG 
TGCACAGCACCTTGCCGTAGAAGAACAGCAAGGCCGCCAATGCCTGACGATGCGTGGAGA 
CCGAAACCTTGCGCTCGTTCGCCAGCCAGGACAGAAATGCCTCGACTTCGCTGCTGCCCA 
AGGTTGCCGGGTGACGCACACCGTGGAAACGGATGAAGGCACGAACCCAGTTGACATAAG 

cctgttcggttcgtaaactgtaatgcaagtagcgtatgcgctcacgcaactggtccagaa 
c cttgacc gaacgcagcggtggtaacggcgcagtggcggttttcatggcttgttatgact 
g i i i i i i i gtacagtctatgcctcgggcatccaagcagcaagcgcgttacgccgtgggtc 
gatgtttgatgttatggagcagcaacgatg™cgcagcagcaacgatgttacgcagcag 
ggcagtcgccctaaaacaaagttaggtggctcaagtatgggcatcattcgcacatgtagg 
ctcggccctgaccmgtcaaatccatgcgggctgctcttgatcttttcggtcgtgagttc 
ggagacgtagccacctactcccaacatcagccggactccgattacctcgggaacttgctc 
cgtagtaagacattcatcgcgcttgctgccttcgaccaagaagcggttgttggcgctctc 
gcggcttacgttctgcccaggtttgagcagccgcgtagtgagatctatatctatgatctc 
gcagtctccggcgagcaccggaggcagggcattgccaccgcgctcatcaatctcctcaag 
catgaggccaacgcgcttggtgcttatgtgatctacgtgcaagcagattacggtgacgat 
cccgcagtggctctctatacaaagttgggcatacgggaagaagtgatgcactttgatatc 
gacccaagtaccgccacctaacaattcgttcaagccgagatcggcttcccggcctaattt 
cccctcgtcaaaaataaggttatcaagtgagaaatcaccatgagtgacgactgaatccgg 
tgagmtggcaamgtttatgcatttctttccagacttgttcaacaggccagccattacg 

CTCGTCATCAAAATCACTCGCATCAACCAAACCGTTATTCATTCGTGATTGCGCCTGAGC 
GAGACGAAATACGCGATCGCTGTTAAAAGGACAATTACAAACAGGAATCGAATGCAACCG 
GCGCAGGAACACTG CCAGC GCATCAACAATATTTTCACCTGAATCAGGATATTCTTCTAA 
TACCTGGAATGCTGTTTTTCCGGGGATCGCAGTGGTGAGTAACCATGCATCATCAGGAGT 
ACGGATAAAATGCTTGATGGTCGGAAGAGGCATAAATTCCGTCAGCCAGTTTAGTCTGAC 
CATCTCATCTGTAACATCATTGGCAACGCTACCTTTGCCATGTTTCAGAAACAACTCTGG 
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CGCATCGGGCTTCCCATACAAGCGATAGATTGTCGCACCTGATTGCCCGACATTATCGCG 

AGCCCATTTATACCCATATAAATCAGCATCCATGTTGGAATTTAATCGCGGCCTCGACGT 

TTCCCGTTGAATATGGCTC ATAACA CCCCCTGTATTACTGTTTATGTAAGCAGACAGTTT 

TATTGTTCATGATGATATATTTTTATCTTGTGCAATGTAACATCAGAGATTTTGAGACAC 

GGGCCAGAGCTGCAGCTGGATGGCAMTMTGATTTTATTTTGACTGATAGTGACCTGTT 

CGTTGCMCAMTTGATMGCMTGCTTTCTTATAATGCCAACTTTGTACAAGAAAGCTG 

MCGAGAMCGTAAMTGATATAMTATCAATATATTAAATTAGATTTTGCATAAAAAAC 

AGACTACATAATACTGTAAAACACAACATATCCAGTCACTATGAATCAACTACTTAGATG 

GTATTAGTGACCTGTAGTCGACTAAGTTGGCAGCATCACCCGACGCACTTTGCGCCGAAT 

AAATACCTGTGACGGAAGATCACTTCGCAGAATAAATAAATCCTGGTGTCCCTGTTGATA 

CCGGGAAGCCCTGGGCCAACTTTGGCGAAAATGAGACGTTGATCGGCACGTAAGAGGTTC 

CMCTTTCACCATMTGAMTMGATCACTACCGGGCGTATTTTTTGAGTTATCGAGATT 

TTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAATCACTGGATATACCACCGTTGATAT 

ATCCCMTGGCATCGTAMGMCATTTTGAGGCATTTCAGTCAGTTGCTCAATGTACCTA 

TMC CAGA CCGTTCAGCTGGATATTACGGCCTTTTTAAAGACCGTAAAGAAAAATAAGCA 

CMGTTTTATCCGGCCTTTATTCACATTCTTGCCCGCCTGATGAATGCTCATCCGGAATT 

CCGTATGGCAATGAAAGACGGTGAGCTGGTGATATGGGATAGTGTTCACCCTTGTTACAC 

CGTTTTCCATGAGCAMCTGAMCGTTTTCATCGCTCTGGAGTGAATACCACGACGATTT 

CCGGCAGTTTCTACACATATATTCGCMGATGTGGCGTGTTACGGTGAAAACCTGGCCTA 

TTTCCCTAMGGGTTTATTGAGMTATGTTTTTCGTCTCAGCCMTCCCTGGGTGAGTTT 

CACCAGTTTTGATTTAMCGTGGCCMTATGGACMCTTCTTCGCCCCCGTTTTCACCAT 

GGGCAAATATTATACGCAAGGCGACMGGTGCTGATGCCGCTGGCGATTCAGGTTCATCA 

TGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGCTTAATGAATTACAACAGTACTGCGA 

TGAGTGGCAGGGCGGGGC GTAAT CGCGTGGATCCGGCTTACTAAAAGCCAGATAACAGTA 

TGCGTATTTGCGCGCTGATTTTTGCGGTATAAGAATATATACTGATATGTATACCCGAAG 

TATGTCAAAAAGAGGTGTGCTATGAAGCAGCGTATTACAGTGACAGTTGACAGCGACAGC 

TATCAGTTGCTCAAGGCATATATGATGTCAATATCTCCGGTCTGGTAAGCACAACCATGC 

AGAATGAAGCCCGTCGTCTGCGTGCCGAACGCTGGAAAGCGGAAAATCAGGAAGGGATGG 

CTGAGGTCGCCCGGTTTATTGAAATGAACGGCTCTTTTGCTGACGAGAACAGGGACTGGT 

GAMTGCAGTTTMGGTTTACACCTATAAAAGAGAGAGCCGTTATCGTCTGTTTGTGGAT 

GTACAGAGTGATATTATTGACACGCCCGGGCGACGGATGGTGATCCCCCTGGCCAGTGCA 

CGTCTGCTGTCAGATAAAGTCTCCCGTGAACTTTACCCGGTGGTGCATATCGGGGATGAA 

AGCTGGCGCATGATGACCACCGATATGGCCAGTGTGCCGGTCTCCGTTATCGGGGAAGAA 

GTGGCTGATCTCAGCCACCGCGAAAATGACATCAAAAACGCCATTAACCTGATGTTCTGG 

GGAATATAAATGTCAGGCTCCCTTATACACAGCCAGTCTGCAGGTCGATACAGTAGAAAT 

TACAGAAACTTTATCACGTTTAGTAAGTATAGAGGCTGAAAATCCAGATGAAGCCGAACG 

ACTTGTMGAGAAMGTATMGAGTTGTGAMTTGTTCTTGATGCAGATGATTTTCAGGA 

CTATGACAC TAGCG TATATG MTAGGTA GATGTTTTTATTTTGTCACACAAAAAAGAGGC 

TCGCACCTCTTTTTCTTATTTCTTTTTATGATTTAATACGGCATTGAGGACAATAGCGAG 

TAGGCTGGATACGACGATTCCGTTTGAGAAGAACATTTGGAAGGCTGTCGGTCGACTAAG 

TTGGCAGCATCACCCGAAGAACATTTGGAAGGCTGTCGGTCGACTACAGGTCACTAATAC 

C ATCTM GTAGTTGATTCATAGTGACTGGATATGTTGTGTTTTACAGTATTATGTAGTCT 

G I I I I I I ATG CAAMTCTMTTTMTATATTGATATTTATATCATTTTACGTTTCTCGTT 

CAGC I I I I I I GTACAAAGTTGGCATTATAAAAAAGCATTGCTCATCAATTTGTTGCAACG 

AACAGGTCACTATCAGTCAAAATAAAATCATTATTTGGGGCCCGAGATCCATGCTAGCGT 
TAAC 
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gccttacgcatctgtgcggtatttcacaccgcaggcaagtgcacaaacaatacttaaata 
mtactactcagtmtmcctatttcttagcatttttgacgamtttgctattttgttag 
agtcttttacaccatttgtctccacacctccgcttacatcaacaccaataacgccattta 
atctaagcgcatcaccaacattttctggcgtcagtccaccagctaacataaaatgtaagc 
tttcggggctctcttgccttccaacccagtcagaaatcgagttccaatccaaaagttcac 
ctgtcccacctgcttctgaatcaaacaagggaataaacgaatgaggtttctgtgaagctg 
cactgagtagtatgttgcagtcttttggaaatacgagtcttttaataactggcaaaccga 
ggaactcttggtattcttgccacgactcatctccatgcagttggacgatatcaatgccgt 
aatcattgaccagagccaaaacatcctccttaggttgattacgaaacacgccaaccaagt 
atttcggagtgcctgmctatttttatatgcttttacmgacttgamttttccttgcaa 
taaccgggtcaattgttctctttctattgggcacacatataatacccagcaagtcagcat 
cggaatctagagcacattctgcggcctctgtgctctgcaagccgcaaactttcaccaatg 
gaccagaactacctgtgaaattaataacagacatactccaagctgcctttgtgtgcttaa 
tcacgtatactcacgtgctcaatagtcaccaatgccctccctcttggccctctccttttc 
i i i i i i cgaccgaattaattcttaatcggcaaaaaaagaamgctccggatcaagattgt 
acgtmggtgacmgctatttttcaataaagaatatcttccactactgccatctggcgtc 
ataactgcaaagtacacatatattacgatgctgtctattaaatgcttcctatattatata 
tatagtaatgtcgtttatggtgcactctcagtacaatctgctctgatgccgcatagttaa 
gccagccccgacacccgccaacacccgctgacgcgccctgacgggcttgtctgctcccgg 
catccgcttacagacaagctgtgaccgtctccgggagctgcatgtgtcagaggttttcac 
cgtcatcaccgamcgcgcgagacgamgggcctcgtgatacgcctatttttataggtta 
atgtcatgataataatggtttcttaggacggatcgcttgcctgtaacttacacgcgcctc 
gtatcttttmtgatggmtmtttgggmmactctgtgtttatttattntatgttt 

TGTATTTGGATTTTAGAAAGTAAATAAAGAAGGTAGAAGAGTTACGGAATGAAGAAAAAA 
AMTAMCAMGGTTTAAAAMTTTCMCAAAMGCGTACTTTACATATATATTTATTAG 
ACAAGAAAAGCAGATTAAATAGATATACATTCGATTAACGATAAGTAAAATGTAAAATCA 
CAGGATTTTCGTGTGTGGTCTTCTACACAGACAAGATGAAACAATTCGGCATTAATACCT 
GAGAGCAGGMGAGCMGATAAMGGTAGTATTTGTTGGCGATCCCCCTAGAGTCTTTTA 

CATCTTCGGAAAACAAAAACTA I I 1 1 1 1 CTTTAATTTC I I I I I I I ACTTTCTA I I I I I AA 
TTTATATAmATATTAAAAMmAMTOTMTOTTTTTATAGCACGTGATGAAAAG 
GACCCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTA I I I I I CTAA 
ATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATAT 
TGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCC I I I I I I GCG 
GCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAA 
GATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTT 
GAGAGTTTTCGCCCCGMGMCGTTTTCCMTGATGAGCACTTTTAAAGTTCTGCTATGT 
GGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTAT 
TCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATG 
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ACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTA 
CTTCTGACMCGATCGGAGGACCGMGGAGCTMCCGCTTTTTTTCACAACATGGGGGAT 
CATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAG 
CGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAA 
CTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCA 
GGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCC 
GGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGT 
ATCGTAGTTATCTACACGACGGGCAGTCAGGCAACTATGGATGAACGAAATAGACAGATC 
GCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATAT 
ATACTTTAGATTGATTTAAMCTTCATTTTTMTTTAAMGGATCTAGGTGMGATCCTT 
TTTGATMTCTCATGACCAAMTCCCTTMCGTGAGTTTTCGTTCCACTGAGCGTCAGAC 
CCCGTAGAAMGATCAMGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGC 
TTGC AAACAA AAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCA 
ACTC I I I I I CCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTA 
GTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCT 
CTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTG 
GACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGC 
ACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCAT 
TGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGG 
GTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGGAACGCCTGGTATCTTTATAGT 
CCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGG 
CC GAGC CTATGGAAAMCGCCAGCMCGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGG 
CCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACC 
GCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTG 
AGCGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATT 
CATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCA 
ATTAATGTGAGTTACCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCT 
CCTATGTTGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCAT 
GATTACGCCAAGCTCGGAATTAACCCTCACTAAAGGGAACAAAAGCTGGGTACCGGGCCC 
CCCCTCGAGATCCGGGATCGAAGAAATGATGGTAAATGAAATAGGAAATCAAGGAGCATG 
AAGGCAAAAGACAAATATAAGGGTCGAACGAAAAATAAAGTGAAAAGTGTTGATATGATG 
TATTTGGCTTTGCGGCGCCGAAAAAACGAGTTTACGCAATTGCACAATCATGCTGACTCT 
GTGG CGGACC CGCGCTCTTGCCGGCCCGGCGATAACGCTGGGCGTGAGGCTGTGCCCGGC 
GGAG I I I I I I GCGCCTGCATTTTCCAAGGTTTACCCTGCGCTAAGGGGCGAGATTGGAGA 
AGCAATAAGAATGCCGGTTGGGGTTGCGATGATGACGACCACGACAACTGGTGTCATTAT 
TTAAGTTGCCGAAAGAACCTGAGTGCATTTGCAACATGAGTATACTAGAAGAATGAGCCA 
AGACTTGCGAGACGCGAGTTTGCCGGTGGTGCGAACAATAGAGCGACCATGACCTTGAAG 
GTGAGACGCGCATAACCGCTAGAGTACTTTGAAGAGGAAACAGCAATAGGGTTGCTACCA 
GTATAMTAGACAGGTACATACMCACTGGAMTGGTTGTCTGTTTGAGTACGCTTTCAA 
TTCATTTGGGTGTGCACTTTATTATGTTACAATATGGAAGGGAACTTTACACTTCTCCTA 
TGCACATATATTAATTAAAGTCCAATGCTAGTAGAGAAGGGGGGTAACACCCCTCCGCGC 
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TCTmCCGATTTTmCTAMCCGTGGMTAmCGGATATCCTmGTTGmCCGGG 
TGTACAATATGGACTTCCTCTTTTCTGGCAACCAAACCCATACATCGGGATTCCTATAAT 
ACCTTCGTTGGTCTCCCTAACATGTAGGTGGCGGAGGGGAGATATACAATAGAACAGATA 
CCAGACAAGACATAATGGGCTAAACAAGACTACACCAATTACACTGCCTCATTGATGGTG 
GTACATAACGAACTAATACTGTAGCCCTAGACTTGATAGCCATCATCATATCGAAGTTTC 
ACTACCCTTTTTCCATTTGCCATCTATTGMGTMTMTAGGCGCATGCMCTTCTTTTC 
1 1 I I 1 1 I I I CTTTTCTCTCTCCCCCGTTGTTGTCTCACCATATCCGCAATGACAAAAAAA 
ATGATGGAAGACACTAAAGGAAAAAATTAACGACAAAGACAGCACCAACAGATGTCGTTG 
TTCCAGAGCTGATGAGGGGTATCTTCGMCACACGAAACTTTTTCCTTCCTTCATTCACG 
CACACTACTCTCTMTGAGCMCGGTATACGGCCTTCCTTCCAGTTACTTGAATTTGAAA 
TAAAAAMGTTTGCCGCT7TGCTATCMGTATAMTAGACCTGCMTTATTMTCTTTTG 
mCCTCGTCATTGTTCTCGTTCCCmCTTCCTTGmCTTmCTGCACMTATTTCA 
AGCTATACCAAGCATACAATCAACTCCAAGCTTATGCCCAAGAAGAAGCGGAAGGTCTCG 
AGCGGCGCCMTTTTMTCAMGTGGGMTATTGCTGATAGCTCATTGTCCTTCACTTTC 
ACTAACAGTAGCAACGGTCCGAACCTCATAACAACTCAAACAAATTCTCAAGCGCTTTCA 
CAACCAATTGCCTCCTCTAACGTTCATGATAACTTCATGAATAATGAAATCACGGCTAGT 
AAAATTGATGATGGTAATAATTCAAAACCACTGTCACCTGGTTGGACGGACCAAACTGCG 
TATAACGCGTTTGGAATCACTACAGGGATGTTTAATACCACTACAATGGATGATGTATAT 
AACTATCTATTCGATGATGAAGATACCCCACCAAACCCAAAAAAAGAGGGTGGGTCGATC 
ACAAGTTTGTACAAAAAAGCAGGCTTGTCGACCCCGGGAATTCAGATCTACTAGTGCGGC 
CGCACGCGTACCCAGCTTTCTTGTACAAAGTGGTGACGTCGAGCTCCCTATAGTGAGTCG 
TATTACACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACACCGGTGAGCTCTAAGT 
MGTMCGGCCGCCACCGCGGTGGAGCTTTGGACTTCTTCGCCAGAGGTTTGGTCAAGTC 
TCCMTCMGGTTGTCGGCTTGTCTACCTTGCCAGAAATTTACGAAAAGATGGAAAAGGG 
TCAMTCGTTGGTAGATACGTTGTTGACACTTCTAMTMGCGMTTTCTTATGATTTAT 
GA I I I I I ATTATTAMTMGTTATAAAAAAAATAAGTGTATACAAATTTTAAAGTGACTC 
TTAGGTTTTAAMCGAAMTTCTTGTTCTTGAGTMCTCTTTCCTGTAGGTCAGGTTGCT 
TTCTCAGGTATAGCATGAGGTCGCTCTTATTGACCACACCTCTACCGGCATGCCGAGCAA 
ATGCCTGCAAATCGCTCCCCATTTCACCCAATTGTAGATATGCTAACTCCAGCAATGAGT 
TGATGMTCTCGGTGTGTATTTTATGTCCTCAGAGGACAATACCTGTTGTAATCGTTCTT 
CCACACGGATCCGCATCAGGCGAMTTGTAMCGTTMTATTTTGTTAAAATTCGCGTTA 
MTATTTGTTAMTCAGCTCATTTTTTMCCMTAGGCCGAMTCGGCAAAATCCCTTAT 
AAATCAAAAGAATAGACCGAGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCA 
CTATTAAAGAACGTGGACTCCAACGTCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGC 
CCACTACGTGMCCATCACCCTMTCMGTTTTTTGGGGTCGAGGTGCCGTAAAGCACTA 
AATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCTTGACGGGGAAAGCCGGCGAACGTG 
GCGAGAAAGGAAGGGAAGAAAGCGAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCG 
GTCACGCTGCGCGTAACCACCACACCCGCCGCGCTTAATGCGCCGCTACAGGGCGCGTCC 
CATTCGCCATTCACTGCA 
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GACGAM6GGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAATGGTTT 
CTTAGGACGGATCGCTTGCCTGTAACTTACACGCGCCTCGTATCTTTTAATGATGGAATA 
AmGGGMmACTCTGTGmAmATTTTTATGTmGTAmGGATmAGAAAGT 
AMTAMGMGGTAGMGAGTTACGGMTGMGAAAAAAAAATAAACAAAGGTTTAAAAA 
ATTTCMCAAAMGCGTACTTTACATATATATTTATTAGACAAGAAAAGCAGATTAAATA 
GATATACATTCGATTMCGATMGTAAMTGTAAAATCACAGGATTTTCGTGTGTGGTCT 
TCTACACAGACAAGATGAAACAATTCGGCATTAATACCTGAGAGCAGGAAGAGCAAGATA 
AMGGTAGTATTTGTTGGCGATCCCCCTAGAGTCTTTTACATCTTCGGAAAACAAAAACT 
A I I I I I I CTTTAATTTC 1 1 1 I I I I ACTTTCTATTTTTMTTTATATATTTATATTAAAAA 
ATTTAMTTATMTTATTTTTATAGCACGTGATGAAMGGACCCAGGTGGCACTTTTCGG 
GGAMTGTGCGCGGMCCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCG 
CTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGT 
ATTCAACATTTCCGTGTCGCCCTTATTCCC I I I I I I GCGGCATTTTGCCTTCCTGTTTTT 
GCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTG 
GGTTACATCGMCTGGATCTCMCAGCGGTMGATCCTTGAGAGTTTTCGCCCCGAAGAA 
CGTTTTCCMTGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATT 
GACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAG 
TACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGT 
GCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGA 
CCGMGGAGCTMCCGCTTTTnTCACAACATGGGGGATCATGTAACTCGCCTTGATCGT 
TGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTA 
GCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGG 
CAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCC 
CTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGT 
ATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACG 
GGCAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTG 
ATTMGCATTGGTMCTGTCAGACCMGTTTACTCATATATACTTTAGATTGATTTAAAA 
CTTCATTTTTAATTTAAMGGATCTAGGTGAAGATCC I I 1 I I GATAATCTCATGACCAAA 
ATCCCTTMCGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGA 
TCTTCTTGAGATCC I 1 1 1 1 1 1 CTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCG 
CTACCAGCGGTGGTTTGTTTGCCGGATCMGAGCTACCMCTCTTTTTCCGAAGGTAACT 
GGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCAC 
CACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTG 
GCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCG 
GATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGA 
ACGACCTACACCGAACTGAGATACCTAGAGCGTGAGCATTGAGAAAGCGCCACGCTTCCC 
GAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACG 
AGGGAGCTTCCAGGGGGGAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTC 
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TGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCCGAGCCTATGGAAAAACGCC 
AGCMCGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTT 
CCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACC 
GCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGC 
CCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGAC 
AGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTACCTCACT 
CATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCCTATGTTGTGTGGAATTGTG 
AGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGATTACGCCAAGCTCGGAATT 
AACCCTCACTAAAGGGAACAAAAGCTGGGTACCGGGCCCCCCCTCGAGATCCGGGATCGA 
AGAAATGATGGTAAATGAAATAGGAAATCAAGGAGCATGAAGGCAAAAGACAAATATAAG 
GGTCGMCGAAAMTAMGTGAAAAGTGTTGATATGATGTATTTGGCTTTGCGGCGCCGA 
AAAAACGAGTTTACGCAATTGCACAATCATGCTGACTCTGTGGCGGACCCGCGCTCTTGC 
CGGCCCGGCGATAACGCTGGGCGTGAGGCTGTGCCCGGCGGAG I I I 1 1 I GCGCCTGCATT 
TTCCAAGGTTTACCCTGCGCTAAGGGGCGAGATTGGAGAAGCAATAAGAATGCCGGTTGG 
GGTTGCGATGATGACGACCACGACMCTGGTGTCATTATTTAAGTTGCCGAAAGAACCTG 
AGTGCATTTGCAACATGAGTATACTAGAAGAATGAGCCAAGACTTGCGAGACGCGAGTTT 
GCCGGTGGTGCGAACAATAGAGCGACCATGACCTTGAAGGTGAGACGCGCATAACCGCTA 
GAGTACTTTGAAGAGGAAACAGCAATAGGGTTGCTACCAGTATAAATAGACAGGTACATA 
CMCACTGGAMTGGTTGTCTGTTTGAGTACGCTTTCMTTCATTTGGGTGTGCACTTTA 
TOTGTOCMTATGGMGGGAACTTTACACTTCTCCTATGCACATATATTAATTAAAGT 
CCAATGCTAGTAGAGAAGGGGGGTAACACCCCTCCGCGCTCTTTTCCGAI 1 1 I I I ICTAA 
ACCGTGGMTATTTCGGATATCCTTTTGTTGTTTCCGGGTGTACAATATGGACTTCCTCT 
TTTCTGGCAACCAAACCCATACATCGGGATTCCTATAATACCTTCGTTGGTCTCCCTAAC 
ATGTAGGTGGCGGAGGGGAGATATACAATAGAACAGATACCAGACAAGACATAATGGGCT 
AAACAAGACTACACCAATTACACTGCCTCATTGATGGTGGTACATAACGAACTAATACTG 
TAGCCCTAGACTTGATAGCCATCATCATATCGMGTTTCACTACCCTTTTTCCATTTGCC 
ATCTATTGAAGTAATAATAGGCGCATGCAACTTCTTTTC I I I I I I I I I CTTTTCTCTCTC 
CCCCGTTGTTGTCTCACCATATCCGCAATGACAAAAAAAATGATGGAAGACACTAAAGGA 
AAAAATTAACGACAAAGACAGCACCAACAGATGTCGTTGTTCCAGAGCTGATGAGGGGTA 
TCTTCGMCACACGAMCTTTTTCCTTCCTTCATTCACGCACACTACTCTCTAATGAGCA 
ACGGTATACGGCCTTCCTTCCAGTTACTTGMTTTGAMTAAAAAMGTTTGCCGCTTTG 
CTATCMGTATAMTAGACCTGCMTTATTMTCTTTTGTTTCCTCGTCATTGTTCTCGT 
TCCCTTTCTTCCTTGTTTCTTTTTCTGCACMTATTTCAAGCTATACCAAGCATACAATC 
MCTCCMGCTTATGCCCAAGAAGAAGCGGAAGGTCTCGAGCGGCGCCAATTTTAATCAA 
AGTGGGAATATTGCTGATAGCTCATTGTCCTTCAGTTTCACTAACAGTAGCAACGGTCCG 
AACCTCATAACAACTCAAACAAATTCTCAAGCGCTTTCACAACCAATTGCCTCCTCTAAC 
GTTCATGATAACTTCATGAATAATGAAATCACGGCTAGTAAAATTGATGATGGTAATAAT 
TCAAAACCACTGTCACCTGGTTGGACGGACCAAACTGCGTATAACGCGTTTGGAATCACT 
ACAGGGATGTTTAATACCACTACAATGGATGATGTATATAACTATCTATTCGATGATGAA 
GATACCCCACCAAACCCAAAAAAAGAGGGTGGGTCGATCACAAGTTTGTACAAAAAAGCA 
GGCTTGTCGACCCCGGGAATTCAGATCTACTAGTGCGGCCGCACGCGTACCCAGCTTTCT 
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TGTACAAAGTGGTGACGTCGAGCTCTAAGTAAGTAACGGCCGCCACCGCGGTGGAGCTTT 
GGACTTCTTCGCCAGAGGTTTGGTCAAGTCTCCAATCAAGGTTGTCGGCTTGTCTACCTT 
GCCAGAMTTTACGAAAAGATGGAAAAGGGTCAAATCGTTGGTAGATACGTTGTTGACAC 
TTCTAMTMGCGMTTTCTTATGATTTATGATTTTTATTATTAMTMGTTATAAAAAA 
MTMGTGTATACAMTTTTAMGTGACTCTTAGGTTTTAAMCGAAMTTCTTGTTCTT 
GAGTMCTCtTTCCTGTAGGTCAGGTTGCTTTCTCAGGTATAGCATGAGGTCGCTCTTAT 
TGACCACACCTCTACCGGCATGCCGAGCAAATGCCTGCAAATCGCTCCCCATTTCACCCA 
ATTGTAGATATGCTAACTCCAGCAATGAGTTGATGAATCTCGGTGTGTATTTTATGTCCT 
CAGAGGACAATACCTGTTGTAATCGTTCTTCCACACGGATCCCAATTCGCCCTATAGTGA 
GTCGTATTACAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGT 
TACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGA 
GGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGACGCGCCC 
TGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTT 
GCCAGCGCCCTAGCGCCCGCTCCT1TGGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCC 
GGCTTTCCCCGTCMGCTCTAMTCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTA 
CGGCACCTCG ACCCCA AAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCC 
TGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCAGGTTCTTTAATAGTGGACTCTTG 
TTCCAMCTGGMCMCACTCMCCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATT 
TTGCCGATTTCGGCCTATTGGTTAAAAMTGAGCTGATTTAACAAAAATTTAACGCGAAT 
TTTMCAAMTATTMCGTTTACMTTTCCTGATGCGGTATTTTCTCCTTACGCATCtGT 
GCGGTATTTCACACCGCAGGCAAGTGCACAAACAATACTTAAATAAATACTACTCAGTAA 
TMCCTAmCTOGCATTmGACGAMmGCTATmGTTAGAGTCTTTTACACCAT 
TTGTC TCCA CACCTCCGCTTACATCAACACCAATAACGCCATTTAATCTAAGCGCATCAC 
CMCATTTTCTGGCGTCAGTCCACCAGCTMCATAAAATGTAAGCTTTCGGGGCTCTCTT 
GCCTTCCAACCCAGTCAGAAATCGAGTTCCAATCCAAAAGTTCACCTGTCCCACCTGCTT 
CTGAATC AAACA AGGGAATAAACGAATGAGGTTTCTGTGAAGCTGCACTGAGTAGTATGT 
TGCAGTCTTTTGGAMTACGAGTCTTTTAATAACTGGCAAACCGAGGAACTCTTGGTATT 
CTTGCCACGACTCATCTCCATGCAGTTGGACGATATCAATGCCGTAATCATTGACCAGAG 
CCAA AACATC CTCCTTAGGTTGATTACGAAACACGCCAACCAAGTATTTCGGAGTGCCTG 
AACTA I I I I I ATATGCTTTTACMGACTTGAMTTTTCCTTGCAATAACCGGGTCAATTG 
TTCTCTTTCTATTGGGCACACATATAATACCCAGCAAGTCAGCATCGGAATCTAGAGCAC 
ATTCTGCGGCCTCTGTGCTCTGCAAGCCGCAAACTTTCACCAATGGACCAGAACTACCTG 
TGAAATTAATAACAGACATACTCCAAGCTGCCTTTGTGTGCTTAATCACGTATACTCACG 
TGCTCAATAGTCACCAATGCCCTCCCTCTTGGCCCTCTCCTTTTC 1 1 I I I I CGACCGAAT 
TA ATTCTTA ATCGGCAAAAAAAGAAAAGCTCCGGATCAAGATTGTACGTAAGGTGACAAG 
CTATTTTTCAATAAAGAATATCTTCCACTACTGCCATCTGGCGTCATAACTGCAAAGTAC 
ACATATATTACGATGCTGTCTATTAAATGCTTCCTATATTATATATATAGTAATGTCGTT 
TATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACC 
CGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGAC 

AAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAAC 
GCGCGA 
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